ZichenWen1 / EPICLinks
(NeurIPS 2025 π₯) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"
β40Updated 2 months ago
Alternatives and similar repositories for EPIC
Users that are interested in EPIC are comparing it to the libraries listed below
Sorting:
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"β65Updated 3 weeks ago
- [ICLR 2026] Geometric-Mean Policy Optimizationβ99Updated 2 weeks ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Modelsβ57Updated 8 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMsβ58Updated 2 weeks ago
- π Collection of token-level model compression resources.β190Updated 5 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvementβ129Updated 6 months ago
- Code release for VTW (AAAI 2025 Oral)β64Updated 3 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β109Updated 8 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)β88Updated 4 months ago
- One-shot Entropy Minimizationβ188Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domainsβ50Updated this week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ91Updated 11 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]β180Updated 8 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concenβ¦β85Updated 7 months ago
- β64Updated 2 weeks ago
- JudgeLRM: Large Reasoning Models as a Judgeβ40Updated last week
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibratiβ¦β46Updated last year
- [TMLR 2025] Efficient Reasoning Models: A Surveyβ298Updated last week
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoningβ96Updated 4 months ago
- [ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Modelsβ423Updated last week
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"β24Updated 11 months ago
- β110Updated last year
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPOβ78Updated 3 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ104Updated 4 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Thinkβ251Updated 4 months ago
- Official Repository of LatentSeekβ76Updated 8 months ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Modelsβ149Updated 4 months ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Modelβ37Updated last year
- β145Updated 4 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Timeβ89Updated 8 months ago