wln20 / CSKVLinks
[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Updated last year
Alternatives and similar repositories for CSKV
Users that are interested in CSKV are comparing it to the libraries listed below
Sorting:
- A Text2SQL benchmark for evaluation of Large Language Models☆36Updated this week
- ☆16Updated 3 months ago
- A comprehensive and efficient long-context model evaluation framework☆26Updated last week
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 8 months ago
- ☆14Updated 10 months ago
- ☆14Updated 11 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆18Updated last month
- ☆17Updated last year
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆26Updated 3 months ago
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆12Updated 2 months ago
- ☆13Updated 8 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 3 weeks ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆22Updated 4 months ago
- ☆16Updated last year
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Updated last year
- ☆14Updated this week
- ☆25Updated 8 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated last month
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆16Updated 2 weeks ago
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated last month
- Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of…☆16Updated last year
- ☆24Updated 2 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆10Updated 8 months ago
- ☆31Updated 3 months ago
- ☆11Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 10 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆10Updated 3 weeks ago
- ☆29Updated last month
- Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training…☆42Updated last year
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆17Updated 8 months ago