henryzhongsc / longctx_bench
Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024
☆76Updated 2 months ago
Alternatives and similar repositories for longctx_bench:
Users that are interested in longctx_bench are comparing it to the libraries listed below
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆80Updated 3 weeks ago
- [NeurIPS 2024] The official implementation of "Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exitin…☆54Updated 10 months ago
- ☆49Updated 11 months ago
- ☆40Updated 5 months ago
- GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM☆161Updated 9 months ago