ZongwuWang / MILLIONView on GitHub
This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization" (DAC'25).
23Apr 2, 2025Updated last year

Alternatives and similar repositories for MILLION

Users that are interested in MILLION are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?