wzhwzhwzh0921 / Awesome_LRM_with_EntropyLinks
Introduction about AWESOME_ENTROPY+LRM_PAPERS
☆25Updated last week
Alternatives and similar repositories for Awesome_LRM_with_Entropy
Users that are interested in Awesome_LRM_with_Entropy are comparing it to the libraries listed below
Sorting:
- ☆111Updated 3 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆228Updated 2 months ago
- 关于LLM和Multimodal LLM的paper list☆50Updated last week
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆103Updated 3 weeks ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,214Updated 2 months ago
- ☆294Updated 5 months ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆124Updated 2 weeks ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆71Updated 8 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆254Updated 4 months ago
- A comprehensive collection of process reward models.☆130Updated 2 months ago
- ☆59Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆392Updated 2 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆321Updated 2 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆148Updated 2 months ago
- ☆1,039Updated last month
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆53Updated last week
- Latest Advances on Long Chain-of-Thought Reasoning☆580Updated 5 months ago
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆17Updated last year
- ☆41Updated 9 months ago
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆24Updated 2 months ago
- Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models☆49Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆118Updated 6 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆29Updated 2 weeks ago
- Membenchmark repository☆41Updated last month
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆834Updated 7 months ago
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆764Updated 3 months ago
- ☆27Updated last year
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆12Updated last year
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆297Updated 2 months ago
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆108Updated last week