ZongwuWang / MILLIONLinks

This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization" (DAC'25).
11Updated 3 months ago

Alternatives and similar repositories for MILLION

Users that are interested in MILLION are comparing it to the libraries listed below

Sorting: