0russwest0 / Agent-R1Links

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

☆1,201

Alternatives and similar repositories for Agent-R1

Users that are interested in Agent-R1 are comparing it to the libraries listed below

Sorting:

0russwest0 / Awesome-Agent-RL
☆490Updated 3 months ago
RUCAIBox / R1-Searcher
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆683Updated 6 months ago
Agent-RL / ReCall
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,314Updated 8 months ago
GAIR-NLP / DeepResearcher
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆694Updated 3 months ago
RUC-NLPIR / ARPO
The official code of ARPO & AEPO
☆880Updated last week
langfengQ / verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆1,501Updated last week
thinkwee / AgentsMeetRL
Awesome List for Agentic RL
☆760Updated last month
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆759Updated 5 months ago
qiancheng0 / ToolRL
☆427Updated 3 months ago
xhyumiracle / Awesome-AgenticLLM-RL-Papers
☆1,513Updated 2 weeks ago
TIGER-AI-Lab / verl-tool
A version of verl to support diverse tool use
☆860Updated last month
lqtrung1998 / mwp_ReFT
☆554Updated last year
BytedTsinghua-SIA / MemAgent
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆881Updated 6 months ago
LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning
Latest Advances on Long Chain-of-Thought Reasoning
☆605Updated 6 months ago
Qihoo360 / Light-R1
☆761Updated last month
Eclipsess / Awesome-Efficient-Reasoning-LLMs
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆731Updated 3 months ago
CharlesQ9 / Self-Evolving-Agents
☆849Updated 3 months ago
xinzhel / LLM-Agent-Survey
Survey on LLM Agents (Published on CoLing 2025)
☆470Updated 4 months ago
RUC-NLPIR / Search-o1
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
☆1,164Updated 2 months ago
zzli2022 / Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
☆1,320Updated 8 months ago
TsinghuaC3I / Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
☆2,316Updated 2 months ago
agentscope-ai / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆510Updated last week
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆691Updated last year
PRIME-RL / TTRL
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆972Updated 4 months ago
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,715Updated 8 months ago
quchangle1 / LLM-Tool-Survey
This is the repository for the Tool Learning survey.
☆478Updated 5 months ago
RUC-NLPIR / Tool-Star
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆314Updated last month
OPPO-PersonalAI / Agent_Foundation_Models
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
☆534Updated 5 months ago
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆807Updated last year
GAIR-NLP / cognition-engineering
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
☆209Updated 9 months ago