meta-pytorch / OpenEnvLinks
An interface library for RL post training with environments.
☆859Updated this week
Alternatives and similar repositories for OpenEnv
Users that are interested in OpenEnv are comparing it to the libraries listed below
Sorting:
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆485Updated 4 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆780Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆581Updated 4 months ago
- Async RL Training at Scale☆950Updated this week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆419Updated last week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆301Updated last week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆504Updated 2 weeks ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆933Updated 6 months ago
- Curated collection of community environments☆195Updated last week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,161Updated this week
- ☆941Updated last month
- Super basic implementation (gist-like) of RLMs with REPL environments.☆286Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 6 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated 2 weeks ago
- An open-source tool for LLM prompt optimization.☆734Updated last week
- On the Theoretical Limitations of Embedding-Based Retrieval☆614Updated 3 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆405Updated last month
- dLLM: Simple Diffusion Language Modeling☆1,504Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆451Updated 4 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆229Updated this week
- ☆235Updated 3 weeks ago
- ☆234Updated 6 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆769Updated 3 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,394Updated this week
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆547Updated this week
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆745Updated last week
- Scalable toolkit for efficient model reinforcement☆1,141Updated last week
- Build your own visual reasoning model☆415Updated last month
- RLP: Reinforcement as a Pretraining Objective☆213Updated 2 months ago