PrimeIntellect-ai/verifiers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PrimeIntellect-ai/verifiers)

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

☆4,389

Alternatives and similar repositories for verifiers

Users that are interested in verifiers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,698Updated this week
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,081Updated this week
OpenPipe / ART
View on GitHub
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆10,496Updated this week
open-thought / reasoning-gym
View on GitHub
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,463Apr 17, 2026Updated 3 months ago
PrimeIntellect-ai / community-environments
View on GitHub
Lightly-reviewed collection of community environments
☆243Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,571Updated this week
NousResearch / atropos
View on GitHub
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆1,337Jul 4, 2026Updated 2 weeks ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,708Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,551Updated this week
thinking-machines-lab / tinker-cookbook
View on GitHub
Post-training with Tinker
☆3,842Updated this week
KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,518Jul 3, 2026Updated 2 weeks ago
NVIDIA-NeMo / RL
View on GitHub
Scalable toolkit for efficient model reinforcement
☆1,835Updated this week
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆502Jan 21, 2026Updated 5 months ago
huggingface / OpenEnv
View on GitHub
An interface library for RL post training with environments.
☆2,436Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,892Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,252Updated this week
PufferAI / PufferLib
View on GitHub
Puffing up reinforcement learning
☆6,161Updated this week
changjonathanc / flex-nano-vllm
View on GitHub
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆355Nov 2, 2025Updated 8 months ago
PrimeIntellect-ai / prime
View on GitHub
Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…
☆223Updated this week
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆13,198Feb 27, 2026Updated 4 months ago
sail-sg / oat
View on GitHub
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆667Jan 29, 2026Updated 5 months ago
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,801Updated this week
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,759Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ServiceNow / PipelineRL
View on GitHub
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆427Updated this week
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,753Apr 14, 2026Updated 3 months ago
xjdr-alt / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆3,434Nov 13, 2024Updated last year
meta-pytorch / torchforge
View on GitHub
PyTorch-native post-training at scale
☆696Updated this week
huggingface / picotron
View on GitHub
Minimalistic 4D-parallelism distributed training framework for education purpose
☆2,254Aug 26, 2025Updated 10 months ago
harbor-framework / harbor
View on GitHub
Framework for evaluating and improving agents
☆3,320Updated this week
Noumena-Network / nmoe
View on GitHub
MoE training for Me and You and maybe other people
☆394Mar 15, 2026Updated 4 months ago
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,545Updated this week
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,575Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
openai / harmony
View on GitHub
Renderer for the harmony response format to be used with gpt-oss
☆4,461Apr 8, 2026Updated 3 months ago
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,755May 26, 2026Updated last month
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,545Updated this week
Ziems / arbor
View on GitHub
A framework for optimizing DSPy programs with RL
☆340Jan 12, 2026Updated 6 months ago
brendanhogan / DeepSeekRL-Extended
View on GitHub
Exploring Applications of GRPO
☆252Aug 25, 2025Updated 10 months ago
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,219Updated this week
TextArena / UnstableBaselines
View on GitHub
☆120Apr 7, 2026Updated 3 months ago