AmesianX / TurboQuantView on GitHub
TurboQuant KV Cache Compression for llama.cpp — 5.2x memory reduction with near-lossless quality | Implementation of Google DeepMind's TurboQuant (ICLR 2026)
60Apr 26, 2026Updated this week

Alternatives and similar repositories for TurboQuant

Users that are interested in TurboQuant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?