Evaluate and improve models and agents using environments
☆868May 1, 2026Updated this week
Alternatives and similar repositories for Gym
Users that are interested in Gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 4 months ago
- ☆14Apr 16, 2025Updated last year
- Scalable data pre processing and curation toolkit for LLMs☆1,550Updated this week
- ☆30Mar 26, 2026Updated last month
- Post-training with Tinker☆3,158Apr 26, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An interface library for RL post training with environments.☆1,806Updated this week
- Hand-Rolled GPU communications library☆92Nov 25, 2025Updated 5 months ago
- MCP server for Youtube☆19Mar 15, 2025Updated last year
- 🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.☆1,746Updated this week
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Apr 20, 2026Updated last week
- ☆33Jan 26, 2026Updated 3 months ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆8,132Updated this week
- ☆26Mar 7, 2026Updated last month
- ☆23Jan 15, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An experimental attempt to make a CLI for supply-chain modeling for Helpful Engineering's Project Data☆10Oct 29, 2023Updated 2 years ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆109Feb 28, 2026Updated 2 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Updated this week
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- PyTorch-native post-training at scale☆675Updated this week
- 🚀 SuperMCP - Create multiple isolated MCP servers using a single connector. Build powerful Model Context Protocol integrations for datab…☆57Jan 26, 2026Updated 3 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,790Updated this week
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆616Updated this week
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆953Feb 28, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Training library for Megatron-based models with bidirectional Hugging Face conversion capability☆599Updated this week
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆60Jan 9, 2024Updated 2 years ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,382Updated this week
- A Lightweight LLM Post-Training Library☆2,249Updated this week
- 🌺 Petal - Flask, for gRPC services.☆12Apr 21, 2019Updated 7 years ago
- The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.☆5,121Updated this week
- Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search☆17Jul 25, 2024Updated last year
- AI coding models, agents, CLIs, IDEs, AI app builders, open source tooling, benchmarks☆52Apr 20, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆435Jul 11, 2025Updated 9 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,631Apr 9, 2026Updated 3 weeks ago
- Standardized environment infrastructure for Agentic AI development.☆296Apr 10, 2026Updated 3 weeks ago
- General-purpose planning and execution harness for LLMs — structured phases, critique, gating, and review☆64Apr 25, 2026Updated last week
- ☆14Apr 7, 2025Updated last year
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- Mixture of Experts from scratch☆13Apr 12, 2024Updated 2 years ago