Embedding Quantization: 25-45x retrieval speedup, 32x or 4x less memory usage(huggingface.co)4 points by cubie 2 years ago | 0 commentsNo comments yet