haoyangliu123 / awesome-deepseek-r1Links

A collection on the recent reproduction papers and projects on DeepSeek-R1

☆32

Alternatives and similar repositories for awesome-deepseek-r1

Users that are interested in awesome-deepseek-r1 are comparing it to the libraries listed below

Sorting:

Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆241Updated 2 months ago
QingyangZhang / Label-Free-RLVR
☆252Updated last month
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
☆277Updated last month
CJReinforce / PURE
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
☆132Updated 3 weeks ago
StarDewXXX / Awesome-Hybrid-CoT-Reasoning
☆52Updated 2 months ago
GAIR-NLP / ToRL
☆263Updated 2 months ago
0russwest0 / Awesome-Agent-RL
☆300Updated 2 months ago
Blueyee / Efficient-CoT-LRMs
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆68Updated 4 months ago
TsinghuaC3I / MARTI
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆185Updated this week
hemingkx / Awesome-Efficient-Reasoning
Paper list for Efficient Reasoning.
☆573Updated last week
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆282Updated 3 weeks ago
ElliottYan / LUFFY
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆268Updated 3 weeks ago
TsinghuaC3I / Awesome-RL-Reasoning-Recipes
Awesome RL Reasoning Recipes ("Triple R")
☆768Updated last month
bruno686 / Awesome-RL-based-LLM-Reasoning
Awesome RL-based LLM Reasoning
☆580Updated 3 weeks ago
bruno686 / Awesome-Agent-Training
Awesome Agent Training
☆204Updated 2 weeks ago
JLZhong23 / awesome-reward-models
☆102Updated 2 months ago
EIT-NLP / Awesome-Latent-CoT
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆142Updated 2 weeks ago
GAIR-NLP / cognition-engineering
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
☆198Updated 3 months ago
RyanLiu112 / Awesome-Process-Reward-Models
A comprehensive collection of process reward models.
☆96Updated 2 weeks ago
Eclipsess / Awesome-Efficient-Reasoning-LLMs
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆547Updated last week
WooooDyy / MathCritique
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆56Updated 8 months ago
LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning
Latest Advances on Long Chain-of-Thought Reasoning
☆459Updated 3 weeks ago
AmourWaltz / Reliable-LLM
☆151Updated 10 months ago
YuxiXie / MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆319Updated last year
AngxiaoYue / awesome-llm-tool-learning
A list of awesome papers on LLM tool learning.
☆25Updated last year
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆239Updated last year
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆715Updated 2 months ago
thinkwee / AgentsMeetRL
An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.
☆238Updated last week
zwxandy / Awesome-Efficient-CoT-Reasoning-Summary
🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…
☆56Updated 2 months ago
ZHZisZZ / modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
☆85Updated 11 months ago