An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.
☆284Feb 3, 2026Updated last month
Alternatives and similar repositories for Open-AgentRL
Users that are interested in Open-AgentRL are comparing it to the libraries listed below
Sorting:
- A construction kit for reinforcement learning environment management.☆352Updated this week
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆59Feb 22, 2026Updated last week
- ☆44Nov 1, 2025Updated 4 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- Latent Collaboration in Multi-Agent Systems☆775Feb 9, 2026Updated 3 weeks ago
- ☆12Feb 26, 2025Updated last year
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated 9 months ago
- RePo: Language Models with Context Re-Positioning☆70Dec 24, 2025Updated 2 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆75Jan 23, 2026Updated last month
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆14Aug 25, 2023Updated 2 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- LLM手撕代码合集☆20Mar 25, 2025Updated 11 months ago
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning☆543Feb 24, 2026Updated last week
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated last week
- ☆13Jun 22, 2024Updated last year
- ☆44Feb 13, 2026Updated 2 weeks ago
- ☆12Jul 31, 2025Updated 7 months ago
- ☆136Jan 26, 2026Updated last month
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆53Nov 4, 2025Updated 3 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆48Dec 25, 2025Updated 2 months ago
- Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning☆29Sep 12, 2025Updated 5 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆608Feb 15, 2026Updated 2 weeks ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated last year
- ☆27Jan 9, 2026Updated last month
- Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization☆396Feb 17, 2026Updated last week
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆435Jan 28, 2026Updated last month
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 4 months ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆45Feb 7, 2026Updated 3 weeks ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆221Nov 27, 2025Updated 3 months ago
- ☆43Jan 30, 2026Updated last month
- ☆223Jun 2, 2025Updated 9 months ago
- [NeurIPS 2025 Spotlight] LLM post-training suite for long-CoT reasoning, PRM, and code generation — featuring ReasonFlux, ReasonFlux-PRM,…☆521Sep 27, 2025Updated 5 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- 🚀 轻量视频🎥 大模型🤖☆21Apr 27, 2025Updated 10 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆26Oct 14, 2025Updated 4 months ago
- ☆28Oct 2, 2025Updated 5 months ago