ruikangliu / IntactKVLinks
[ACL 2024] Official PyTorch implementation of "IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact"
☆48Updated last year
Alternatives and similar repositories for IntactKV
Users that are interested in IntactKV are comparing it to the libraries listed below
Sorting:
- The Official Implementation of Ada-KV [NeurIPS 2025]☆118Updated 2 weeks ago
- QAQ: Quality Adaptive Quantization for LLM KV Cache☆55Updated last year
- [NeurIPS 2024] The official implementation of "Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exitin…