Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
☆1,148Apr 27, 2026Updated last week
Alternatives and similar repositories for atropos
Users that are interested in atropos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Agentic RL Training at Scale☆1,338Updated this week
- Our library for RL environments + evals☆4,057Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆483Sep 27, 2024Updated last year
- An open infrastructure to democratize and decentralize the development of superintelligence for humanity.☆740Mar 24, 2026Updated last month
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,406Apr 17, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Distributed Training Over-The-Internet☆1,018Oct 14, 2025Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆111Mar 7, 2025Updated last year
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,790Apr 28, 2026Updated last week
- Entropy Based Sampling and Parallel CoT Decoding☆3,431Nov 13, 2024Updated last year
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,382Updated this week
- Democratizing Reinforcement Learning for LLMs☆5,462Updated this week
- ☆36Dec 15, 2025Updated 4 months ago
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 8 months ago
- MoE training for Me and You and maybe other people☆383Mar 15, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Go ahead and axolotl questions☆11,779Apr 27, 2026Updated last week
- Tools for merging pretrained large language models.☆7,052Mar 15, 2026Updated last month
- A PyTorch native library for large model training☆28Apr 1, 2026Updated last month
- DeMo: Decoupled Momentum Optimization☆201Dec 2, 2024Updated last year
- ☆138Mar 20, 2025Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,199Apr 27, 2026Updated last week
- ☆19Mar 16, 2025Updated last year
- Efficient Triton Kernels for LLM Training☆6,315Apr 27, 2026Updated last week
- Asynchronous P2P communication backend for decentralized pipeline parallelism☆43Jun 9, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Puffing up reinforcement learning☆5,639Updated this week
- Automatic evals for LLMs☆591Feb 24, 2026Updated 2 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆652Jan 29, 2026Updated 3 months ago
- Minimalistic large language model 3D-parallelism training☆2,674Apr 7, 2026Updated 3 weeks ago
- Scalable toolkit for efficient model reinforcement☆1,602Updated this week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆259Oct 30, 2024Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆672Jul 29, 2025Updated 9 months ago
- AllenAI's post-training codebase☆3,708Updated this week
- A Gym for Agentic LLMs☆478Jan 21, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Train your own SOTA deductive reasoning model☆110Mar 6, 2025Updated last year
- Scalable RL solution for advanced reasoning of language models☆1,852Mar 18, 2025Updated last year
- ☆128May 19, 2024Updated last year
- ☆345Mar 5, 2026Updated 2 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆404Apr 28, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆34,180Updated this week
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆855Nov 16, 2025Updated 5 months ago