alibaba / ROCKLinks
A construction kit for reinforcement learning environment management.
☆326Updated this week
Alternatives and similar repositories for ROCK
Users that are interested in ROCK are comparing it to the libraries listed below
Sorting:
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆229Updated 5 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆552Updated 2 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆468Updated 8 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆534Updated 5 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆189Updated 3 weeks ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆238Updated 8 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆520Updated this week
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆253Updated 5 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆296Updated 3 weeks ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆191Updated 7 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 3 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆220Updated 8 months ago
- MiroTrain is an efficient and algorithm-first framework research agent.☆132Updated 5 months ago
- Towards a Unified View of Large Language Model Post-Training☆200Updated 5 months ago
- ☆275Updated 5 months ago
- ☆520Updated last month
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆353Updated 3 weeks ago
- A set of examples based on verl for end-to-end RL training recipes.☆152Updated last week
- ☆230Updated last month
- ☆327Updated 4 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Updated last week
- A Comprehensive Survey on Long Context Language Modeling☆226Updated 2 months ago
- ☆82Updated 10 months ago
- ☆449Updated 5 months ago
- ☆281Updated last week
- qqr is an RL training framework for open-ended agents.☆205Updated 2 weeks ago
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆204Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆881Updated 6 months ago
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆106Updated 6 months ago
- ☆78Updated 7 months ago