StarDewXXX / O1-PrunerLinks

Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

☆97

Alternatives and similar repositories for O1-Pruner

Users that are interested in O1-Pruner are comparing it to the libraries listed below

Sorting:

GeniusHTX / TALE
☆136Updated 2 months ago
Blueyee / Efficient-CoT-LRMs
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆70Updated 7 months ago
hemingkx / TokenSkip
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆189Updated 4 months ago
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆257Updated 6 months ago
Zanette-Labs / efficient-reasoning
☆67Updated 7 months ago
THU-KEG / AdaptThink
☆165Updated last month
multimodal-art-projection / LatentCoT-Horizon
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆277Updated 2 weeks ago
EIT-NLP / Awesome-Latent-CoT
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆190Updated last week
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆83Updated 11 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆87Updated 9 months ago
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆169Updated 6 months ago
zjunlp / LightThinker
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆123Updated 7 months ago
ruixin31 / Spurious_Rewards
☆341Updated 3 months ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆80Updated 5 months ago
StarDewXXX / AdaR1
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆20Updated 2 weeks ago
AlphaLab-USTC / LRM-plans-CoT
[NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"
☆25Updated 4 months ago
BaohaoLiao / RSD
[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
☆51Updated 6 months ago
iie-ycx / DEER
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
☆177Updated 4 months ago
ltzheng / SimpleTIR
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆318Updated 2 months ago
LightChen233 / reasoning-boundary
☆69Updated 5 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆131Updated 8 months ago
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆315Updated last month
GAIR-NLP / LIMR
☆212Updated 9 months ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆83Updated 7 months ago
ChnQ / MI-Peaks
☆55Updated 4 months ago
tianyi-lab / MiP-Overthinking
[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
☆35Updated 5 months ago
sail-sg / CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
☆132Updated 8 months ago
ssmisya / PRMBench
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆84Updated 9 months ago
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆180Updated 4 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆87Updated this week