bruno686 / Awesome-RL-based-LLM-ReasoningLinks

Awesome RL-based LLM Reasoning

☆634

Alternatives and similar repositories for Awesome-RL-based-LLM-Reasoning

Users that are interested in Awesome-RL-based-LLM-Reasoning are comparing it to the libraries listed below

Sorting:

0russwest0 / Awesome-Agent-RL
☆406Updated last month
hemingkx / Awesome-Efficient-Reasoning
Paper list for Efficient Reasoning.
☆669Updated 2 weeks ago
Eclipsess / Awesome-Efficient-Reasoning-LLMs
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆627Updated 2 weeks ago
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆301Updated last week
Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…
☆1,191Updated 2 weeks ago
LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning
Latest Advances on Long Chain-of-Thought Reasoning
☆516Updated 2 months ago
TsinghuaC3I / Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
☆1,615Updated this week
0russwest0 / Agent-R1
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆818Updated 2 months ago
zzli2022 / Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
☆1,242Updated 3 months ago
thinkwee / AgentsMeetRL
An Awesome List of Agentic Model trained with Reinforcement Learning
☆483Updated 2 weeks ago
TideDra / lmm-r1
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
☆822Updated 4 months ago
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆739Updated last month
xhyumiracle / Awesome-AgenticLLM-RL-Papers
☆838Updated last month
QingyangZhang / Label-Free-RLVR
☆269Updated 3 months ago
RUC-NLPIR / ARPO
✨ Agentic Reinforced Policy Optimization
☆634Updated 2 weeks ago
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆246Updated last month
zhaochen0110 / Awesome_Think_With_Images
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,000Updated this week
bruno686 / Awesome-Agent-Training
Awesome Agent Training
☆231Updated last month
langfengQ / verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆961Updated this week
GAIR-NLP / cognition-engineering
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
☆206Updated 5 months ago
TIGER-AI-Lab / verl-tool
A version of verl to support diverse tool use
☆570Updated this week
xinzhel / LLM-Agent-Survey
Survey on LLM Agents (Published on CoLing 2025)
☆394Updated 5 months ago
ElliottYan / LUFFY
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆330Updated this week
OSU-NLP-Group / GUI-Agents-Paper-List
Building a comprehensive and handy list of papers for GUI agents
☆512Updated 2 weeks ago
StarDewXXX / Awesome-Hybrid-CoT-Reasoning
☆53Updated 4 months ago
yaotingwangofficial / Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
☆834Updated last month
lqtrung1998 / mwp_ReFT
☆549Updated 9 months ago
TsinghuaC3I / MARTI
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆276Updated this week
jianghoucheng / AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆331Updated 3 months ago
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆340Updated 2 months ago