PrimeIntellect-ai / prime-rlLinks
prime-rl is a codebase for decentralized async RL training at scale
☆311Updated this week
Alternatives and similar repositories for prime-rl
Users that are interested in prime-rl are comparing it to the libraries listed below
Sorting:
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆333Updated 5 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆171Updated 4 months ago
- ☆126Updated 2 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆367Updated this week
- Exploring Applications of GRPO☆229Updated 2 weeks ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆343Updated last week
- PyTorch building blocks for the OLMo ecosystem☆222Updated this week
- Long context evaluation for large language models☆211Updated 3 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆100Updated 2 months ago
- Scalable toolkit for efficient model reinforcement☆385Updated this week
- EvaByte: Efficient Byte-level Language Models at Scale☆98Updated last month
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆237Updated 4 months ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- Tina: Tiny Reasoning Models via LoRA☆245Updated this week
- ☆111Updated 5 months ago
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training☆504Updated 4 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆447Updated this week
- Build your own visual reasoning model☆370Updated this week
- ☆188Updated 3 months ago
- Normalized Transformer (nGPT)☆181Updated 6 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆169Updated this week
- An extension of the nanoGPT repository for training small MOE models.☆147Updated 2 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 6 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆105Updated last month
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆88Updated last week
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆248Updated this week
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆249Updated this week
- Manage scalable open LLM inference endpoints in Slurm clusters☆258Updated 10 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 7 months ago
- DeMo: Decoupled Momentum Optimization☆188Updated 6 months ago