hugging face always has good blog posts too https://huggingface.co/blog/embedding-quantization#retrieval-speed