PrimeIntellect-ai / prime-rlLinks

Decentralized RL Training at Scale

☆400

Alternatives and similar repositories for prime-rl

Users that are interested in prime-rl are comparing it to the libraries listed below

Sorting:

facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆344Updated 7 months ago
NousResearch / atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆568Updated last week
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆277Updated last week
PrimeIntellect-ai / genesys
☆130Updated 4 months ago
LeonGuertler / UnstableBaselines
☆94Updated last week
brendanhogan / DeepSeekRL-Extended
Exploring Applications of GRPO
☆245Updated 3 weeks ago
magicproduct / hash-hop
Long context evaluation for large language models
☆220Updated 5 months ago
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆225Updated this week
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆512Updated 3 weeks ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆269Updated this week
huggingface / picotron_tutorial
☆206Updated 5 months ago
open-thought / reasoning-gym
procedural reasoning datasets
☆1,012Updated this week
marin-community / marin
☆347Updated this week
microsoft / dion
Dion optimizer algorithm
☆193Updated this week
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆185Updated 8 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆418Updated last week
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆321Updated 8 months ago
eqimp / hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆113Updated 3 weeks ago
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆134Updated this week
PrimeIntellect-ai / OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
☆521Updated 6 months ago
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆274Updated 2 months ago
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆558Updated this week
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
pyember / ember
☆209Updated last month
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆513Updated this week
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆103Updated 4 months ago
arcprize / arc-agi-benchmarking
Testing baseline LLMs performance across various models
☆291Updated last week
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
jerber / lang-jepa
☆118Updated 7 months ago