ZongwuWang / MILLIONView on GitHub
This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization" (DAC'25).
23Apr 2, 2025Updated 11 months ago

Alternatives and similar repositories for MILLION

Users that are interested in MILLION are comparing it to the libraries listed below

Sorting:

Are these results useful?