Async RL Training at Scale
☆1,156Mar 18, 2026Updated this week
Alternatives and similar repositories for prime-rl
Users that are interested in prime-rl are comparing it to the libraries listed below
Sorting:
- Our library for RL environments + evals☆3,918Updated this week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆914Updated this week
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆175Updated this week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆380Mar 10, 2026Updated last week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,699Updated this week
- Lightly-reviewed collection of community environments☆219Mar 12, 2026Updated last week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,368Mar 15, 2026Updated last week
- slime is an LLM post-training framework for RL Scaling.☆4,799Updated this week
- Scalable toolkit for efficient model reinforcement☆1,418Updated this week
- Solidity contracts for the decentralized Prime Network protocol☆26Jul 6, 2025Updated 8 months ago
- AllenAI's post-training codebase☆3,629Updated this week
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆851Nov 16, 2025Updated 4 months ago
- Democratizing Reinforcement Learning for LLMs☆5,259Updated this week
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆137Nov 10, 2025Updated 4 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆148Sep 12, 2025Updated 6 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆639Jan 29, 2026Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,821Mar 18, 2025Updated last year
- Train your own SOTA deductive reasoning model☆108Mar 6, 2025Updated last year
- Build your own visual reasoning model☆419Jan 13, 2026Updated 2 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆134Feb 21, 2026Updated last month
- Minimalistic large language model 3D-parallelism training☆2,617Feb 19, 2026Updated last month
- A Gym for Agentic LLMs☆467Jan 21, 2026Updated 2 months ago
- MoE training for Me and You and maybe other people☆375Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,919Updated this week
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆222Nov 27, 2025Updated 3 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆925Feb 28, 2026Updated 3 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,116Aug 26, 2025Updated 6 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,553Mar 15, 2026Updated last week
- ☆1,113Jan 10, 2026Updated 2 months ago
- An interface library for RL post training with environments.☆1,250Mar 14, 2026Updated last week
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆52Apr 14, 2025Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- Automatic evals for LLMs☆583Feb 24, 2026Updated 3 weeks ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 5 months ago
- A PyTorch native platform for training generative AI models☆5,162Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,339Mar 9, 2026Updated last week
- ☆137Mar 20, 2025Updated last year
- Tile primitives for speedy kernels☆3,232Updated this week
- Open-source framework for the research and development of foundation models.☆803Updated this week