RUC-NLPIR/ARPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RUC-NLPIR/ARPO)

RUC-NLPIR / ARPO

[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)

☆1,092

Alternatives and similar repositories for ARPO

Users that are interested in ARPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUC-NLPIR / Tool-Star
View on GitHub
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆403Apr 3, 2026Updated 3 months ago
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,145Jun 9, 2026Updated last month
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,141Nov 13, 2025Updated 8 months ago
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,023Jul 15, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RUC-NLPIR / GISA
View on GitHub
GISA: A Benchmark for General Information-Seeking Assistant
☆36Mar 20, 2026Updated 4 months ago
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,410May 16, 2025Updated last year
qiancheng0 / ToolRL
View on GitHub
☆513Oct 16, 2025Updated 9 months ago
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,626Updated this week
HHHHHejia / Awesome-AgenticLLM-RL-Papers
View on GitHub
☆1,843Jun 18, 2026Updated last month
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,721Updated this week
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆444Jul 11, 2025Updated last year
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆5,081Jul 15, 2026Updated last week
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,753Apr 14, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JIA-Lab-research / ARPO
View on GitHub
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆162May 29, 2025Updated last year
GAIR-NLP / ToRL
View on GitHub
☆352May 24, 2025Updated last year
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,607Updated this week
AMAP-ML / Tree-GRPO
View on GitHub
[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning
☆386Jan 26, 2026Updated 5 months ago
BytedTsinghua-SIA / MemAgent
View on GitHub
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆1,085May 12, 2026Updated 2 months ago
RUC-NLPIR / DeepAgent
View on GitHub
[WWW‘26 Oral🔥] DeepAgent: A General Reasoning Agent with Scalable Toolsets
☆1,118Apr 13, 2026Updated 3 months ago
RUC-NLPIR / EnvScaler
View on GitHub
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆177Feb 12, 2026Updated 5 months ago
WooooDyy / AgentGym-RL
View on GitHub
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…
☆819Feb 15, 2026Updated 5 months ago
inclusionAI / ASearcher
View on GitHub
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆602Nov 26, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alibaba / ROLL
View on GitHub
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆3,321Updated this week
EvolvingLMMs-Lab / multimodal-search-r1
View on GitHub
[ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…
☆469Apr 7, 2026Updated 3 months ago
Simple-Efficient / RL-Factory
View on GitHub
Train your Agent model via our easy and efficient framework
☆1,773Dec 5, 2025Updated 7 months ago
OPPO-PersonalAI / Agent_Foundation_Models
View on GitHub
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
☆579Sep 8, 2025Updated 10 months ago
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,589Updated this week
thinkwee / AgentsMeetRL
View on GitHub
Awesome List for Agentic RL
☆1,716Updated this week
GAIR-NLP / DeepResearcher
View on GitHub
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆782May 10, 2026Updated 2 months ago
BytedTsinghua-SIA / DAPO
View on GitHub
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,844May 11, 2025Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,841Jul 14, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dengmengjie / ToolScope
View on GitHub
Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
☆31Nov 4, 2025Updated 8 months ago
AgentR1 / Agent-R1
View on GitHub
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆1,565Updated this week
RUC-NLPIR / Search-o1
View on GitHub
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
☆1,240Nov 17, 2025Updated 8 months ago
Simplified-Reasoning / LUFFY
View on GitHub
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆460Mar 20, 2026Updated 4 months ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,019Jul 1, 2026Updated 3 weeks ago
TsinghuaC3I / Awesome-RL-for-LRMs
View on GitHub
A Survey of Reinforcement Learning for Large Reasoning Models
☆2,468Nov 9, 2025Updated 8 months ago
Gen-Verse / Open-AgentRL
View on GitHub
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
☆589Jun 12, 2026Updated last month