NousResearch / atroposLinks

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

☆568

Alternatives and similar repositories for atropos

Users that are interested in atropos are comparing it to the libraries listed below

Sorting:

PrimeIntellect-ai / prime-rl
Decentralized RL Training at Scale
☆400Updated this week
open-thought / reasoning-gym
procedural reasoning datasets
☆1,012Updated this week
NousResearch / Open-Reasoning-Tasks
A comprehensive repository of reasoning tasks for LLMs (and beyond)
☆448Updated 10 months ago
arcprize / arc-agi-benchmarking
Testing baseline LLMs performance across various models
☆291Updated last week
PrimeIntellect-ai / genesys
☆130Updated 4 months ago
pyember / ember
☆209Updated last month
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆267Updated 3 weeks ago
LeonGuertler / UnstableBaselines
☆94Updated last week
willccbb / verifiers
Verifiers for LLM Reinforcement Learning
☆1,690Updated this week
brendanhogan / DeepSeekRL-Extended
Exploring Applications of GRPO
☆245Updated 3 weeks ago
haizelabs / Awesome-LLM-Judges
⚖️ Awesome LLM Judges ⚖️
☆108Updated 3 months ago
aidanmclaughlin / AidanBench
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆307Updated last month
groundlight / r1_vlm
Build your own visual reasoning model
☆401Updated this week
xjdr-alt / entropix-local
smol models are fun too
☆93Updated 8 months ago
facebookresearch / MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆538Updated last week
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
open-thought / system-2-research
System 2 Reasoning Link Collection
☆848Updated 4 months ago
PsycheFoundation / psyche
An open infrastructure to democratize and decentralize the development of superintelligence for humanity.
☆439Updated this week
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆513Updated last week
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
jerber / lang-jepa
☆118Updated 7 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
PrimeIntellect-ai / prime
prime is a framework for efficient, globally distributed training of AI models over the internet.
☆783Updated 2 months ago
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆809Updated last week
magicproduct / hash-hop
Long context evaluation for large language models
☆220Updated 5 months ago
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆321Updated 8 months ago
aw31 / openai-imo-2025-proofs
☆455Updated 2 weeks ago
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆808Updated 2 weeks ago
NousResearch / DisTrO
Distributed Training Over-The-Internet
☆951Updated 2 months ago
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆98Updated 2 months ago