krafton-ai / lexicoLinks
KV cache compression via sparse coding
☆10Updated last month
Alternatives and similar repositories for lexico
Users that are interested in lexico are comparing it to the libraries listed below
Sorting:
- Official Implementation of FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation☆20Updated last month
- ☆10Updated last week
- This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acceleration☆14Updated 3 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 3 months ago
- ☆29Updated 2 months ago
- ☆12Updated 5 months ago
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 2 months ago
- ☆42Updated 7 months ago
- ☆14Updated last month
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 3 months ago
- The code implementation of Symbolic-MoE☆32Updated 3 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆16Updated last month
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆19Updated 4 months ago
- Code for Heima☆46Updated 2 months ago
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆39Updated 2 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆36Updated last week
- Unsupervised GRPO☆33Updated 2 weeks ago
- ☆13Updated last month
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆12Updated 6 months ago
- The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"☆36Updated this week
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 8 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆16Updated last month
- Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆29Updated last month
- ☆37Updated 8 months ago
- ☆16Updated 5 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆8Updated 4 months ago
- LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆18Updated 2 months ago
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆47Updated 3 weeks ago
- ☆17Updated 5 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆15Updated last week