alibaba/ROCK

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba/ROCK)

alibaba / ROCK

A construction kit for reinforcement learning environment management.

☆470

Alternatives and similar repositories for ROCK

Users that are interested in ROCK are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alibaba / ROLL
View on GitHub
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆3,316Updated this week
alibaba / RecIS
View on GitHub
A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.
☆350Feb 9, 2026Updated 5 months ago
alibaba / terminal-bench-pro
View on GitHub
☆119Apr 1, 2026Updated 3 months ago
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆502Jan 21, 2026Updated 6 months ago
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,586Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,594Updated this week
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,086Updated this week
alibaba / rtp-llm
View on GitHub
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
☆1,281Updated this week
alibaba / paimon-cpp
View on GitHub
Paimon-cpp is a high-performance C++ implementation of Apache Paimon.
☆126Updated this week
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,767Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,587Updated this week
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,022Jul 15, 2026Updated last week
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆975Jul 4, 2026Updated 2 weeks ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,715Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
agentscope-ai / Trinity-RFT
View on GitHub
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆672Updated this week
ByteDance-Seed / VeOmni
View on GitHub
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
☆2,102Updated this week
redai-infra / Relax
View on GitHub
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
☆526Updated this week
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,834Jul 14, 2026Updated last week
NVIDIA-NeMo / ProRL-Agent-Server
View on GitHub
Agentic RL on Any Harness at Scale
☆699Jul 15, 2026Updated last week
Gen-Verse / Open-AgentRL
View on GitHub
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
☆589Jun 12, 2026Updated last month
THUDM / AgentRL
View on GitHub
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆322Jan 17, 2026Updated 6 months ago
harbor-framework / harbor
View on GitHub
Framework for evaluating and improving agents
☆3,348Updated this week
alibaba / Megatron-LLaMA
View on GitHub
Best practice for training LLaMA models in Megatron-LM
☆666Jan 2, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,143Jun 9, 2026Updated last month
rlops / rlix
View on GitHub
Run more RL experiments. Wait less for GPUs.
☆290Updated this week
stepfun-ai / SteptronOss
View on GitHub
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…
☆577May 18, 2026Updated 2 months ago
fzyzcjy / torch_memory_saver
View on GitHub
Allow torch tensor memory to be released and resumed later
☆260Updated this week
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,753Apr 14, 2026Updated 3 months ago
verl-project / uni-agent
View on GitHub
A unified framework for building, running, and training general agents at scale.
☆433Updated this week
alibaba / PyFlightProfiler
View on GitHub
PyFlightProfiler: A diagnostic toolbox for Python applications that provides non-intrusive, low-overhead capabilities for online analysis…
☆43May 18, 2026Updated 2 months ago
bytedance / SandboxFusion
View on GitHub
☆1,042Jul 14, 2026Updated last week
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
inclusionAI / AWorld
View on GitHub
Search, understand, reproduce, and improve an idea with ease
☆1,211Updated this week
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
inclusionAI / AEnvironment
View on GitHub
Standardized environment infrastructure for Agentic AI development.
☆313Jul 10, 2026Updated last week
Gen-Verse / OpenClaw-RL
View on GitHub
OpenClaw-RL: Train any agent simply by talking
☆5,596May 23, 2026Updated 2 months ago
RLsys-Foundation / APRIL
View on GitHub
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…
☆60Oct 11, 2025Updated 9 months ago
Qwen-Applications / OpenRS
View on GitHub
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
☆19Mar 5, 2026Updated 4 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,133Nov 13, 2025Updated 8 months ago