[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
β773Feb 28, 2026Updated 3 months ago
Alternatives and similar repositories for Awesome-Efficient-Reasoning-LLMs
Users that are interested in Awesome-Efficient-Reasoning-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Paper list for Efficient Reasoning.β889May 29, 2026Updated 3 weeks ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ355Jan 22, 2026Updated 4 months ago
- Latest Advances on System-2 Reasoningβ1,351Jun 8, 2025Updated last year
- Latest Advances on Long Chain-of-Thought Reasoningβ637Jul 18, 2025Updated 11 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruningβ99Feb 21, 2025Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [TMLR 2025] Efficient Reasoning Models: A Surveyβ313Mar 9, 2026Updated 3 months ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,637Apr 20, 2026Updated last month
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ223Nov 30, 2025Updated 6 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ262May 14, 2025Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compressionβ164Apr 7, 2026Updated 2 months ago
- β75Apr 13, 2025Updated last year
- β150Sep 12, 2025Updated 9 months ago
- β22Oct 3, 2024Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β339May 13, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,261Aug 27, 2025Updated 9 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ92Feb 14, 2025Updated last year
- KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024β90Feb 27, 2025Updated last year
- A Survey of Reinforcement Learning for Large Reasoning Modelsβ2,466Nov 9, 2025Updated 7 months ago
- A series of technical report on Slow Thinking with LLMβ765Aug 13, 2025Updated 10 months ago
- β80Jun 8, 2026Updated last week
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-basβ¦β1,421May 11, 2026Updated last month
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,969Updated this week
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factualityβ47May 19, 2026Updated last month
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official code repository for Sketch-of-Thought (SoT)β138May 8, 2025Updated last year
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,441Apr 6, 2026Updated 2 months ago
- β71Jun 18, 2025Updated last year
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Modelsβ125Oct 16, 2025Updated 8 months ago
- Simple RL training for reasoningβ3,864Dec 23, 2025Updated 5 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asyβ¦β9,652Jun 9, 2026Updated last week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β260Mar 7, 2026Updated 3 months ago
- β225Mar 26, 2025Updated last year
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,824May 11, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Repo for Open-Reasoner-Zeroβ2,097Jun 2, 2025Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"β452Mar 20, 2026Updated 2 months ago
- Paper List of Inference/Test Time Scaling/Computingβ388May 31, 2026Updated 2 weeks ago
- A Sober Look at Language Model Reasoningβ92Nov 18, 2025Updated 7 months ago
- Multimodal Chain-of-Thought Reasoning: A Comprehensive Surveyβ1,005May 22, 2026Updated 3 weeks ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,077Jul 30, 2025Updated 10 months ago
- This is the official implementation of the paper "SΒ²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"β73Apr 22, 2025Updated last year