wzhwzhwzh0921 / Awesome_LRM_with_EntropyLinks
Introduction about AWESOME_ENTROPY+LRM_PAPERS
☆29Updated last month
Alternatives and similar repositories for Awesome_LRM_with_Entropy
Users that are interested in Awesome_LRM_with_Entropy are comparing it to the libraries listed below
Sorting:
- ☆112Updated 4 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆248Updated 3 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆404Updated 3 months ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆128Updated this week
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆12Updated last year
- ☆303Updated 6 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆601Updated 6 months ago
- ☆28Updated 2 weeks ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆117Updated 7 months ago
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆837Updated 8 months ago
- ☆59Updated last year
- Official Repository of "Learning what reinforcement learning can't"☆79Updated last month
- ☆88Updated last year
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,285Updated last month
- Extrapolating RLVR to General Domains without Verifiers☆191Updated 5 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆106Updated last month
- ☆57Updated 7 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆312Updated 3 weeks ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆258Updated 5 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Updated 9 months ago
- 关于LLM和Multimodal LLM的paper list☆55Updated 2 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆411Updated 6 months ago
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…☆69Updated 6 months ago
- ☆489Updated 3 months ago
- Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models☆51Updated last year
- MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka☆322Updated 7 months ago
- llm & rl☆268Updated 3 months ago
- Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"☆49Updated 4 months ago
- Agentic MLLMs☆159Updated 3 months ago
- This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages …☆751Updated 4 months ago