HugoZHL / PQCache

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference
33Updated 2 weeks ago

Alternatives and similar repositories for PQCache:

Users that are interested in PQCache are comparing it to the libraries listed below