[ICLR 2025๐ฅ] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
โ27Jul 7, 2025Updated 10 months ago
Alternatives and similar repositories for D2O
Users that are interested in D2O are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- โ47Nov 25, 2024Updated last year
- โ43Oct 16, 2025Updated 7 months ago
- โ38Mar 17, 2025Updated last year
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attentionโ53Aug 6, 2025Updated 9 months ago
- The Official Implementation of Ada-KV [NeurIPS 2025]โ131Nov 26, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".โ17Sep 15, 2024Updated last year
- This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago โฆโ14Jan 19, 2022Updated 4 years ago
- โ314Jul 10, 2025Updated 10 months ago
- โ47Mar 15, 2025Updated last year
- ๅไบซไธไบS2Sๅจๅฎ้ ๅบ็จไธญ้ๅฐ็้ฎ้ขๅ่งฃๅณๆนๆณใโ28Aug 3, 2020Updated 5 years ago
- AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extractionโ17Dec 23, 2021Updated 4 years ago
- [ACL 2026] Repository of IPBenchโ22Apr 6, 2026Updated last month
- Game UI Glitch Detection via Bug Understandingโ12Jul 31, 2021Updated 4 years ago
- Awesome-LLM-KV-Cache: A curated list of ๐Awesome LLM KV Cache Papers with Codes.โ435Mar 3, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"โ18Dec 15, 2020Updated 5 years ago
- Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.โ10Sep 6, 2022Updated 3 years ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)โ11Jun 12, 2020Updated 5 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.โ10May 16, 2024Updated 2 years ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)โ53Dec 17, 2024Updated last year
- ๐ฐ Must-read papers on KV Cache Compression (constantly updating ๐ค).โ705Apr 15, 2026Updated last month
- โ10Apr 29, 2023Updated 3 years ago
- AloePlayer: a cross-platform local media player.โ17Jan 24, 2026Updated 3 months ago
- โ24Jun 7, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- โ34Sep 19, 2025Updated 8 months ago
- โ10Dec 3, 2024Updated last year
- [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generationโ254Dec 16, 2024Updated last year
- Fast and memory-efficient exact attentionโ21Apr 10, 2026Updated last month
- KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024โ90Feb 27, 2025Updated last year
- Benchmarking Social Intelligence of Language Agents through Interactive Scenariosโ13Jan 4, 2025Updated last year
- PyTorch implementation of our ECCV 2022 paper "Rethinking Confidence Calibration for Failure Prediction"โ26Jun 10, 2023Updated 2 years ago
- โ20Jan 26, 2026Updated 3 months ago
- Extending context length of visual language modelsโ12Dec 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- โ41May 24, 2024Updated last year
- โ75Apr 13, 2025Updated last year
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023โ22Mar 12, 2026Updated 2 months ago
- [CIKM-21] Pytorch implementation of LiteGT: Efficient and Lightweight Graph Transformersโ12Nov 16, 2021Updated 4 years ago
- Self-Distribution BNNโ10Mar 8, 2022Updated 4 years ago
- โ18Jun 3, 2024Updated last year
- โ19Feb 18, 2025Updated last year