TurboQuant

TurboQuant

New LLM compression algorithm by Google

451 followers

A set of advanced theoretically grounded quantization algorithms that enable massive compression for large language models and vector search engines.