meta-pytorch / OpenEnvLinks
An interface library for RL post training with environments.
☆1,112Updated last week
Alternatives and similar repositories for OpenEnv
Users that are interested in OpenEnv are comparing it to the libraries listed below
Sorting:
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆853Updated this week
- Async RL Training at Scale☆1,044Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆583Updated 5 months ago
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆692Updated this week
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆625Updated last week
- Post-training with Tinker☆2,805Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆592Updated last month
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆313Updated last week
- Lightly-reviewed collection of community environments☆210Updated last week
- dLLM: Simple Diffusion Language Modeling☆1,716Updated this week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,244Updated last week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,547Updated this week
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆642Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- ☆961Updated 3 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,332Updated 3 weeks ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆426Updated last month
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end refere…☆392Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆515Updated 2 months ago
- An open-source tool for LLM prompt optimization.☆765Updated 2 weeks ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆938Updated 8 months ago
- Training API and CLI☆325Updated last week
- PyTorch-native post-training at scale☆613Updated this week
- ☆237Updated last month
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆833Updated last month
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆460Updated 5 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆825Updated 2 weeks ago
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆542Updated this week
- Scalable toolkit for efficient model reinforcement☆1,293Updated this week