ZichenWen1 / EPICLinks
(NeurIPS 2025 π₯) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"
β40Updated last month
Alternatives and similar repositories for EPIC
Users that are interested in EPIC are comparing it to the libraries listed below
Sorting:
- Geometric-Mean Policy Optimizationβ96Updated last month
- dParallel: Learnable Parallel Decoding for dLLMsβ53Updated 2 months ago
- β23Updated 7 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concenβ¦β84Updated 6 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"β58Updated 3 weeks ago
- One-shot Entropy Minimizationβ187Updated 6 months ago
- Code release for VTW (AAAI 2025 Oral)β65Updated 2 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPOβ73Updated 2 months ago
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Modelsβ384Updated 3 weeks ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Thinkβ249Updated 3 months ago
- The code and data of We-Math 2.0.β163Updated 4 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β109Updated 7 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Modelsβ56Updated 7 months ago
- repo for paper https://arxiv.org/abs/2504.13837β314Updated 3 weeks ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)β86Updated 3 months ago
- β99Updated 3 weeks ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Modelsβ146Updated 3 months ago
- Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"β72Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judgeβ40Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ89Updated 10 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMsβ62Updated 2 weeks ago
- Research works from Tencent AI Lab regarding self-evolving agentsβ78Updated 4 months ago
- β346Updated 5 months ago
- β62Updated 8 months ago
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"β73Updated 3 months ago
- [TMLR 2025] Efficient Reasoning Models: A Surveyβ290Updated last week
- SIFT: Grounding LLM Reasoning in Contexts via Stickersβ57Updated 10 months ago
- Official Repository of LatentSeekβ73Updated 7 months ago
- Code for Heimaβ58Updated 8 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.β148Updated 6 months ago