alibaba / ROCKLinks
A construction kit for reinforcement learning environment management.
☆326Updated this week
Alternatives and similar repositories for ROCK
Users that are interested in ROCK are comparing it to the libraries listed below
Sorting:
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆229Updated 5 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆552Updated 2 months ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆253Updated 5 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆189Updated 3 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 3 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆534Updated 5 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆353Updated 3 weeks ago
- ☆230Updated last month
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆520Updated this week
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆238Updated 8 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆468Updated 8 months ago
- ☆327Updated 4 months ago
- DeepConf: Deep Think with Confidence☆367Updated 4 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆220Updated 8 months ago
- Towards a Unified View of Large Language Model Post-Training☆200Updated 5 months ago
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆106Updated 6 months ago
- The evaluation benchmark on MCP servers☆238Updated 5 months ago
- A set of examples based on verl for end-to-end RL training recipes.☆152Updated last week
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆191Updated 7 months ago
- MiroTrain is an efficient and algorithm-first framework research agent.☆132Updated 5 months ago
- EvoCUA: Evolving Computer Use Agent☆248Updated 2 weeks ago
- ☆74Updated 8 months ago
- ☆103Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆261Updated 8 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆164Updated 4 months ago
- A Comprehensive Survey on Long Context Language Modeling☆226Updated 2 months ago
- ☆271Updated last week
- ☆520Updated last month
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆313Updated this week
- ☆281Updated last week