alibaba / ROCKLinks
A construction kit for reinforcement learning environment management.
☆226Updated this week
Alternatives and similar repositories for ROCK
Users that are interested in ROCK are comparing it to the libraries listed below
Sorting:
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆504Updated this week
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆456Updated 6 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆179Updated 3 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆497Updated 2 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆235Updated 6 months ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆244Updated 3 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆187Updated 4 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆207Updated 6 months ago
- ☆300Updated 6 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆145Updated last week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆283Updated last month
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆418Updated last week
- ☆819Updated 5 months ago
- A Comprehensive Survey on Long Context Language Modeling☆204Updated last week
- ☆279Updated 2 months ago
- ☆235Updated 3 months ago
- Reproducing R1 for Code with Reliable Rewards☆272Updated 6 months ago
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆108Updated 3 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆330Updated 2 months ago
- Async pipelined version of Verl☆125Updated 7 months ago
- Towards a Unified View of Large Language Model Post-Training☆187Updated 2 months ago
- ☆74Updated 5 months ago
- ☆180Updated 2 weeks ago
- DeepConf: Deep Think with Confidence☆320Updated 2 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆324Updated 7 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆252Updated 6 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆848Updated last week
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆208Updated 2 months ago
- ☆91Updated 6 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆803Updated 4 months ago