NVIDIA-NeMo/ProRL-Agent-Server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA-NeMo/ProRL-Agent-Server)

NVIDIA-NeMo / ProRL-Agent-Server

Agentic RL on Any Harness at Scale

☆714

Alternatives and similar repositories for ProRL-Agent-Server

Users that are interested in ProRL-Agent-Server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,102Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,679Updated this week
NVIDIA-NeMo / labs-molt
View on GitHub
☆716Updated this week
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,805Updated this week
NVIDIA-NeMo / RL
View on GitHub
Scalable toolkit for efficient model reinforcement
☆1,855Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,026Jul 15, 2026Updated 2 weeks ago
hamishivi / tmax
View on GitHub
Training terminal-agents
☆253Updated this week
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 6 months ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,740Updated this week
Infini-AI-Lab / astraflow
View on GitHub
Dataflow-Oriented Reinforcement Learning for (Multi-)Agentic LLMs
☆97Updated this week
Gen-Verse / OpenClaw-RL
View on GitHub
OpenClaw-RL: Train any agent simply by talking
☆5,611May 23, 2026Updated 2 months ago
harbor-framework / harbor
View on GitHub
Framework for evaluating and improving agents
☆3,611Updated this week
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,612Updated this week
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆503Jan 21, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆984Jul 4, 2026Updated 3 weeks ago
complex-reasoning / RPG
View on GitHub
[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)
☆76Jun 29, 2026Updated last month
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆677Mar 21, 2026Updated 4 months ago
vllm-project / vime
View on GitHub
An LLM post-training framework with vLLM for RL Scaling
☆390Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,699Updated this week
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,759Updated this week
NVIDIA-NeMo / Automodel
View on GitHub
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
☆771Updated this week
ByteDance-Seed / EdgeBench
View on GitHub
EdgeBench: Unveiling scaling laws of learning from real-world environments
☆393Jul 17, 2026Updated last week
NVlabs / ToolOrchestra
View on GitHub
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
☆753Mar 25, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,027Jul 1, 2026Updated 3 weeks ago
alibaba / ROLL
View on GitHub
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆3,333Updated this week
open-thoughts / OpenThoughts-Agent
View on GitHub
Data recipes and robust infrastructure for training AI agents
☆271Updated this week
verl-project / uni-agent
View on GitHub
Uni-Agent is a framework for training long-horizon agents.
☆449Updated this week
LMIS-ORG / slime-agentic
View on GitHub
A project implementing various agentic RL based on the Slime post-training framework
☆509Apr 11, 2026Updated 3 months ago
ChenxinAn-fdu / POLARIS
View on GitHub
Scaling RL on advanced reasoning models
☆692Oct 20, 2025Updated 9 months ago
redai-infra / Relax
View on GitHub
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
☆546Updated this week
NVIDIA-NeMo / Gym
View on GitHub
Evaluate and improve models and agents using environments
☆1,072Updated this week
Ascend / TransferQueue
View on GitHub
An asynchronous streaming data management module for efficient post-training.
☆123Jul 12, 2026Updated 2 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
RLsys-Foundation / APRIL
View on GitHub
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…
☆60Oct 11, 2025Updated 9 months ago
stepfun-ai / SteptronOss
View on GitHub
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…
☆579May 18, 2026Updated 2 months ago
R2E-Gym / R2E-Gym
View on GitHub
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆312Jul 13, 2025Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,855Jul 14, 2026Updated 2 weeks ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,757Updated this week
PrimeIntellect-ai / renderers
View on GitHub
Programmable chat templates for LLM training and inference.
☆135Updated this week
facebookresearch / swe-rl
View on GitHub
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆712Mar 16, 2025Updated last year