A construction kit for reinforcement learning environment management.
☆384Mar 19, 2026Updated this week
Alternatives and similar repositories for ROCK
Users that are interested in ROCK are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆2,989Updated this week
- A Lightweight LLM Inference Performance Simulator☆67Updated this week
- RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings☆381Feb 27, 2026Updated 3 weeks ago
- Best practice for training LLaMA models in Megatron-LM☆664Jan 2, 2024Updated 2 years ago
- Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.☆201Updated this week
- ☆956Dec 11, 2025Updated 3 months ago
- General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.☆43Updated this week
- MemVerse: Multimodal Memory for Lifelong Learning Agents☆135Updated this week
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆1,070Updated this week
- ☆14Oct 11, 2023Updated 2 years ago
- ☆32Sep 19, 2025Updated 6 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆27Feb 17, 2026Updated last month
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆36Feb 25, 2026Updated 3 weeks ago
- ☆18Apr 18, 2025Updated 11 months ago
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆15May 27, 2023Updated 2 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆54Oct 11, 2025Updated 5 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆103Sep 24, 2025Updated 5 months ago
- A Gym for Agentic LLMs☆467Jan 21, 2026Updated 2 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆645Updated this week
- REverse-Engineered Reasoning for Open-Ended Generation☆94Sep 10, 2025Updated 6 months ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆96Mar 5, 2026Updated 2 weeks ago
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 8 months ago
- ☆11Feb 5, 2024Updated 2 years ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆44May 10, 2023Updated 2 years ago
- Website code for Boximator: Generating Rich and Dynamic Motions for Video Synthesis☆17Feb 19, 2024Updated 2 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated last year
- ☆14Aug 21, 2025Updated 7 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆312Oct 13, 2025Updated 5 months ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated 2 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,145Updated this week
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆4,855Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,097Updated this week
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆17Sep 8, 2022Updated 3 years ago
- ☆94May 16, 2025Updated 10 months ago
- Code and updates for the ScoreRS project.☆42Sep 19, 2025Updated 6 months ago
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆1,001Updated this week