HugoZHL / PQCache

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference
44Updated 3 months ago

Alternatives and similar repositories for PQCache:

Users that are interested in PQCache are comparing it to the libraries listed below