TsinghuaC3I / Awesome-RL-Reasoning-RecipesLinks

Awesome RL Reasoning Recipes ("Triple R")

☆797

Alternatives and similar repositories for Awesome-RL-Reasoning-Recipes

Users that are interested in Awesome-RL-Reasoning-Recipes are comparing it to the libraries listed below

Sorting:

bruno686 / Awesome-RL-based-LLM-Reasoning
Awesome RL-based LLM Reasoning
☆601Updated last month
zzli2022 / Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
☆1,224Updated 2 months ago
0russwest0 / Agent-R1
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆757Updated last month
0russwest0 / Awesome-Agent-RL
☆349Updated 2 weeks ago
langfengQ / verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆791Updated this week
Eclipsess / Awesome-Efficient-Reasoning-LLMs
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆574Updated last week
hemingkx / Awesome-Efficient-Reasoning
Paper list for Efficient Reasoning.
☆608Updated last week
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆726Updated 2 weeks ago
LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning
Latest Advances on Long Chain-of-Thought Reasoning
☆481Updated last month
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆660Updated 7 months ago
Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…
☆1,102Updated last week
thinkwee / AgentsMeetRL
An Awesome List of Agentic Model trained with Reinforcement Learning
☆370Updated last week
lqtrung1998 / mwp_ReFT
☆546Updated 7 months ago
TideDra / lmm-r1
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
☆812Updated 3 months ago
lsdefine / simple_GRPO
A very simple GRPO implement for reproducing r1-like LLM thinking.
☆1,290Updated 3 weeks ago
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆286Updated 2 weeks ago
dongguanting / ARPO
✨ Agentic Reinforced Policy Optimization
☆512Updated last week
xinzhel / LLM-Agent-Survey
Survey on LLM Agents (Published on CoLing 2025)
☆371Updated 3 months ago
ElliottYan / LUFFY
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆285Updated last month
PRIME-RL / TTRL
TTRL: Test-Time Reinforcement Learning
☆769Updated last week
dvlab-research / Step-DPO
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆376Updated 7 months ago
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆244Updated 2 weeks ago
GAIR-NLP / cognition-engineering
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
☆203Updated 4 months ago
bruno686 / Awesome-Agent-Training
Awesome Agent Training
☆213Updated 3 weeks ago
wjn1996 / Awesome-LLM-Reasoning-Openai-o1-Survey
The related works and background techniques about Openai o1
☆224Updated 7 months ago
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆310Updated last month
openreasoner / openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,816Updated 7 months ago
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,518Updated 3 months ago
haoyangliu123 / awesome-deepseek-r1
A collection on the recent reproduction papers and projects on DeepSeek-R1
☆32Updated 6 months ago
GAIR-NLP / O1-Journey
O1 Replication Journey
☆1,998Updated 7 months ago