A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆368Feb 19, 2026Updated last week
Alternatives and similar repositories for PipelineRL
Users that are interested in PipelineRL are comparing it to the libraries listed below
Sorting:
- Async RL Training at Scale☆1,096Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆633Jan 29, 2026Updated last month
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Apr 26, 2025Updated 10 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,090Aug 26, 2025Updated 6 months ago
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated last week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,628Updated this week
- Official implementation of TBA for async LLM post-training.☆29Nov 5, 2025Updated 3 months ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆234Updated this week
- A bibliography and survey of the papers surrounding o1☆1,212Nov 16, 2024Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆331Apr 24, 2025Updated 10 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Jul 15, 2025Updated 7 months ago
- ☆1,104Jan 10, 2026Updated last month
- Scalable toolkit for efficient model reinforcement☆1,353Updated this week
- Official Repo for Open-Reasoner-Zero☆2,087Jun 2, 2025Updated 8 months ago
- ☆331May 31, 2025Updated 9 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆266Jul 8, 2025Updated 7 months ago
- Scalable RL solution for advanced reasoning of language models☆1,809Mar 18, 2025Updated 11 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆233Aug 27, 2025Updated 6 months ago
- Async pipelined version of Verl☆124Apr 8, 2025Updated 10 months ago
- slime is an LLM post-training framework for RL Scaling.☆4,381Updated this week
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆302Dec 16, 2025Updated 2 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆912Updated this week
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆906Updated this week
- 🌎💪 BrowserGym, a Gym environment for web task automation☆1,136Feb 10, 2026Updated 2 weeks ago
- Efficient Triton Kernels for LLM Training☆6,162Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,352Jan 16, 2026Updated last month
- ☆38Aug 7, 2025Updated 6 months ago
- ☆123Feb 21, 2025Updated last year
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,522Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆154Feb 3, 2025Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆201Jun 1, 2025Updated 9 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 8 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆632Jul 29, 2025Updated 7 months ago
- Democratizing Reinforcement Learning for LLMs☆5,167Updated this week
- AllenAI's post-training codebase☆3,592Updated this week
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month