Build RL environments for LLM training
☆758Mar 21, 2026Updated this week
Alternatives and similar repositories for Gym
Users that are interested in Gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scalable toolkit for efficient model reinforcement☆1,447Updated this week
- Open-source library for scalable, reproducible evaluation of AI models and benchmarks.☆240Updated this week
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 3 months ago
- ☆14Apr 16, 2025Updated 11 months ago
- An interface library for RL post training with environments.☆1,288Updated this week
- Scalable data pre processing and curation toolkit for LLMs☆1,460Updated this week
- 微聚,专业的数据标注,采集平台☆13Jun 19, 2018Updated 7 years ago
- Post-training with Tinker☆2,971Updated this week
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆866Updated this week
- ☆25Mar 7, 2026Updated 2 weeks ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆7,745Updated this week
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆565Mar 11, 2026Updated last week
- Online Preference Alignment for Language Models via Count-based Exploration☆17Jan 14, 2025Updated last year
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆925Feb 28, 2026Updated 3 weeks ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,050Updated this week
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- A Lightweight LLM Post-Training Library☆2,196Updated this week
- AI coding models, agents, CLIs, IDEs, AI app builders, open source tooling, benchmarks☆43Feb 24, 2026Updated last month
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,097Updated this week
- An open source MCP proxy.☆17Jan 3, 2025Updated last year
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆4,855Updated this week
- Harness for running and evaluating AI agents against RL environments☆132Mar 6, 2026Updated 2 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆424Jul 11, 2025Updated 8 months ago
- ☆14Apr 7, 2025Updated 11 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,630Oct 23, 2025Updated 5 months ago
- ☆27Dec 13, 2024Updated last year
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- Synkra AIOS: AI-Orchestrated System for Full Stack Development - Core Framework v4.0☆2,378Mar 11, 2026Updated last week
- cuTile is a programming model for writing parallel kernels for NVIDIA GPUs☆1,975Updated this week
- ☆43Jan 27, 2026Updated last month
- Training library for Megatron-based models with bidirectional Hugging Face conversion capability☆509Updated this week
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"☆65Dec 29, 2025Updated 2 months ago
- A Gym for Agentic LLMs☆467Jan 21, 2026Updated 2 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆115Feb 2, 2026Updated last month
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 8 months ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆15Dec 11, 2023Updated 2 years ago
- ☆87Aug 16, 2025Updated 7 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 5 months ago