alibaba / ROCKLinks
A construction kit for reinforcement learning environment management.
☆292Updated this week
Alternatives and similar repositories for ROCK
Users that are interested in ROCK are comparing it to the libraries listed below
Sorting:
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆213Updated 4 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆529Updated last month
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆299Updated 3 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆187Updated 6 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆467Updated 7 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆520Updated 4 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆217Updated 7 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆160Updated last month
- ☆318Updated 3 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆238Updated 7 months ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆247Updated 5 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆348Updated 3 months ago
- ☆255Updated 5 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆473Updated this week
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆107Updated 5 months ago
- The evaluation benchmark on MCP servers☆234Updated 4 months ago
- ☆498Updated 3 weeks ago
- ☆261Updated this week
- MiroTrain is an efficient and algorithm-first framework for post-training large agentic models.☆114Updated 4 months ago
- Towards a Unified View of Large Language Model Post-Training☆199Updated 4 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 8 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 7 months ago
- A Comprehensive Survey on Long Context Language Modeling☆215Updated last month
- DeepConf: Deep Think with Confidence☆344Updated 3 months ago
- ☆87Updated 4 months ago
- Scaling RL on advanced reasoning models☆656Updated 2 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆249Updated last month
- ☆326Updated 7 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆182Updated 5 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆849Updated 5 months ago