ZongwuWang / MILLION
View external linksLinks

This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization" (DAC'25).
23Apr 2, 2025Updated 10 months ago

Alternatives and similar repositories for MILLION

Users that are interested in MILLION are comparing it to the libraries listed below

Sorting:

Are these results useful?