[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
β1,010Apr 13, 2026Updated last month
Alternatives and similar repositories for ARPO
Users that are interested in ARPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ385Apr 3, 2026Updated last month
- RAG methods, benchmarks, and toolkitsβ19Nov 28, 2024Updated last year
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoningβ387Mar 30, 2026Updated last month
- Repo for paper "ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability" (ACL 2026 Main)β176Apr 9, 2026Updated last month
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-inβ¦β1,909Feb 27, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β67Aug 14, 2025Updated 9 months ago
- β51May 7, 2026Updated 2 weeks ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replayβ158May 29, 2025Updated 11 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,385May 16, 2025Updated last year
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,514Updated this week
- Some example codes for drawing figures in research paperβ35Mar 3, 2022Updated 4 years ago
- π Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]β1,221Nov 17, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,753Nov 13, 2025Updated 6 months ago
- A version of verl to support diverse tool useβ982Mar 2, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodalβ¦β446Apr 7, 2026Updated last month
- β219Feb 20, 2025Updated last year
- β40Apr 6, 2026Updated last month
- The demo, code and data of FollowRAGβ76Jun 30, 2025Updated 10 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.β572Sep 8, 2025Updated 8 months ago
- Democratizing Reinforcement Learning for LLMsβ5,548Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,810May 11, 2025Updated last year
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β754May 10, 2026Updated 2 weeks ago
- β184Dec 5, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β1,774Jan 20, 2026Updated 4 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Useβ30Nov 4, 2025Updated 6 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRLβ4,950Apr 6, 2026Updated last month
- β28Jul 18, 2025Updated 10 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asyβ¦β9,523May 15, 2026Updated last week
- A series of technical report on Slow Thinking with LLMβ765Aug 13, 2025Updated 9 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,668Apr 14, 2026Updated last month
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"β84Dec 20, 2024Updated last year
- Official Repo for Open-Reasoner-Zeroβ2,091Jun 2, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Modelsβ3,165Updated this week
- Train your Agent model via our easy and efficient frameworkβ1,754Dec 5, 2025Updated 5 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searchesβ40Oct 9, 2025Updated 7 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scalingβ188Jul 23, 2025Updated 10 months ago
- β148Nov 17, 2025Updated 6 months ago
- SSRL: Self-Search Reinforcement Learningβ208Aug 20, 2025Updated 9 months ago
- Simple RL training for reasoningβ3,859Dec 23, 2025Updated 5 months ago