HugoZHL / PQCache

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference
37Updated last month

Alternatives and similar repositories for PQCache:

Users that are interested in PQCache are comparing it to the libraries listed below