Eclipsess / Awesome-Efficient-Reasoning-LLMsLinks
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
โ464Updated last week
Alternatives and similar repositories for Awesome-Efficient-Reasoning-LLMs
Users that are interested in Awesome-Efficient-Reasoning-LLMs are comparing it to the libraries listed below
Sorting:
- Paper list for Efficient Reasoning.โ509Updated this week
- ๐ A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyondโ252Updated 2 weeks ago
- A series of technical report on Slow Thinking with LLMโ699Updated 2 weeks ago
- Awesome RL-based LLM Reasoningโ526Updated last month
- Latest Advances on Long Chain-of-Thought Reasoningโ390Updated 3 weeks ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsโฆโ228Updated 2 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningโ222Updated last month
- โ242Updated last month
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)โ639Updated 5 months ago
- โ300Updated 3 weeks ago
- Awesome RL Reasoning Recipes ("Triple R")โ697Updated last week
- The related works and background techniques about Openai o1โ222Updated 5 months ago
- โ220Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ226Updated last year
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!โ54Updated 2 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learningโ573Updated 3 weeks ago
- [arXiv 2025] Efficient Reasoning Models: A Surveyโ181Updated last week
- โ203Updated 4 months ago
- โ222Updated this week
- Official Repository of "Learning to Reason under Off-Policy Guidance"โ240Updated 3 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.โ191Updated this week
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"โ369Updated 5 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringโ188Updated 2 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.โ241Updated 2 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMsโ156Updated 3 months ago
- โ782Updated last month
- Paper List of Inference/Test Time Scaling/Computingโ264Updated this week
- Survey on LLM Agents (Published on CoLing 2025)โ314Updated last month
- A Comprehensive Survey on Long Context Language Modelingโ152Updated 2 weeks ago
- โ540Updated 5 months ago