[ ] commonplace
Browse Log in Get started
Browse Log in Get started
← Back to blog
Bookmark Tuesday, January 6, 2026

Quantized Retrieval - a Hugging Face Space by sentence-transformers | Tom Aarsen | 10 comments

→ linkedin.com
"You can perform 200ms search over 40 million texts using just a CPU server, 8GB of RAM, and 40GB of disk space. The trick: Binary search with int8 rescoring."
← Previous Stoic (@Stoiiic) on X
Random
Next → Did filesystem tools kill vector search?

Collected over time.

· v0.15.0 (64ba6d7)
RSS Subscribe