Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
☆872Feb 28, 2026Updated this week
Alternatives and similar repositories for atropos
Users that are interested in atropos are comparing it to the libraries listed below
Sorting:
- Async RL Training at Scale☆1,107Updated this week
- Our library for RL environments + evals☆3,869Updated this week
- An open infrastructure to democratize and decentralize the development of superintelligence for humanity.☆600Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,352Jan 16, 2026Updated last month
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆459Sep 27, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆3,434Nov 13, 2024Updated last year
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,656Updated this week
- Distributed Training Over-The-Internet☆978Oct 14, 2025Updated 4 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Mar 7, 2025Updated 11 months ago
- MoE training for Me and You and maybe other people☆364Feb 7, 2026Updated 3 weeks ago
- Democratizing Reinforcement Learning for LLMs☆5,167Updated this week
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆8,695Updated this week
- Go ahead and axolotl questions☆11,395Updated this week
- Automatic evals for LLMs☆580Feb 24, 2026Updated last week
- Tools for merging pretrained large language models.☆6,826Updated this week
- ☆19Mar 16, 2025Updated 11 months ago
- DeMo: Decoupled Momentum Optimization☆198Dec 2, 2024Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,108Feb 23, 2026Updated last week
- Efficient Triton Kernels for LLM Training☆6,162Feb 27, 2026Updated last week
- Simplifying reinforcement learning for complex game environments☆5,119Updated this week
- ☆415Nov 2, 2023Updated 2 years ago
- ☆137Mar 20, 2025Updated 11 months ago
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated 2 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆252Oct 30, 2024Updated last year
- AllenAI's post-training codebase☆3,605Updated this week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆373Feb 26, 2026Updated last week
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆644Jul 29, 2025Updated 7 months ago
- ☆32Dec 15, 2025Updated 2 months ago
- Scalable toolkit for efficient model reinforcement☆1,372Updated this week
- ☆337Updated this week
- Asynchronous P2P communication backend for decentralized pipeline parallelism☆42Jun 9, 2025Updated 8 months ago
- ☆1,033Dec 17, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- A Gym for Agentic LLMs☆452Jan 21, 2026Updated last month
- Storing long contexts in tiny caches with self-study☆243Dec 5, 2025Updated 3 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆678Mar 16, 2025Updated 11 months ago
- System 2 Reasoning Link Collection☆869Mar 16, 2025Updated 11 months ago