Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
☆1,280Jun 8, 2026Updated this week
Alternatives and similar repositories for atropos
Users that are interested in atropos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Agentic RL Training at Scale☆1,455Updated this week
- Our library for RL environments + evals☆4,187Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆493Sep 27, 2024Updated last year
- An open infrastructure to democratize and decentralize the development of superintelligence for humanity.☆846Mar 24, 2026Updated 2 months ago
- Distributed Training Over-The-Internet☆1,034Oct 14, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,444Apr 17, 2026Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆111Mar 7, 2025Updated last year
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,993Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆3,435Nov 13, 2024Updated last year
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,953Jun 6, 2026Updated last week
- Democratizing Reinforcement Learning for LLMs☆5,608Updated this week
- ☆36Dec 15, 2025Updated 5 months ago
- Exploring Applications of GRPO☆253Aug 25, 2025Updated 9 months ago
- MoE training for Me and You and maybe other people☆386Mar 15, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Go ahead and axolotl questions☆12,032Updated this week
- Tools for merging pretrained large language models.☆7,126May 6, 2026Updated last month
- A PyTorch native library for large model training☆30Apr 1, 2026Updated 2 months ago
- DeMo: Decoupled Momentum Optimization☆201Dec 2, 2024Updated last year
- ☆138Mar 20, 2025Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,251Updated this week
- ☆19Mar 16, 2025Updated last year
- Efficient Triton Kernels for LLM Training☆6,430Updated this week
- Asynchronous P2P communication backend for decentralized pipeline parallelism☆43Jun 9, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatic evals for LLMs☆597Feb 24, 2026Updated 3 months ago
- Puffing up reinforcement learning☆5,976Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆661Jan 29, 2026Updated 4 months ago
- Scalable toolkit for efficient model reinforcement☆1,711Updated this week
- Minimalistic large language model 3D-parallelism training☆2,715May 26, 2026Updated 2 weeks ago
- AllenAI's post-training codebase☆3,746Updated this week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆257Oct 30, 2024Updated last year
- A Gym for Agentic LLMs☆494Jan 21, 2026Updated 4 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆686Jul 29, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Scalable RL solution for advanced reasoning of language models☆1,862Mar 18, 2025Updated last year
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- DSPy: The framework for programming—not prompting—language models☆34,958Updated this week
- System 2 Reasoning Link Collection☆875Mar 16, 2025Updated last year
- ☆128May 19, 2024Updated 2 years ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆419Updated this week