NVIDIA-NeMo / GymLinks
Build RL environments for LLM training
☆141Updated this week
Alternatives and similar repositories for Gym
Users that are interested in Gym are comparing it to the libraries listed below
Sorting:
- ☆235Updated 3 weeks ago
- Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"☆325Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- PyTorch-native post-training at scale☆566Updated this week
- ☆301Updated 4 months ago
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆525Updated last week
- Agent computer interface for AI software engineer.☆115Updated last week
- A benchmark for LLMs on complicated tasks in the terminal☆1,196Updated 2 weeks ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆133Updated 2 months ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference exampl…☆173Updated this week
- Post-training with Tinker☆2,578Updated this week
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆474Updated 3 weeks ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆481Updated 3 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆312Updated 3 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- ☆126Updated 2 months ago
- OpenCUA: Open Foundations for Computer-Use Agents☆602Updated last week
- A clean, modular SDK for building AI agents with OpenHands V1.☆337Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆242Updated last month
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆22Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆354Updated 5 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆769Updated 2 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆450Updated 3 months ago
- The LLM abstraction layer for modern AI agent applications.☆496Updated this week
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆529Updated 3 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆485Updated last week
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆508Updated 2 weeks ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆502Updated last week
- ☆616Updated this week
- ☆92Updated last month