NousResearch / DisTrOLinks

Distributed Training Over-The-Internet

☆963

Alternatives and similar repositories for DisTrO

Users that are interested in DisTrO are comparing it to the libraries listed below

Sorting:

PrimeIntellect-ai / prime-diloco
prime is a framework for efficient, globally distributed training of AI models over the internet.
☆849Updated last week
PrimeIntellect-ai / OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
☆547Updated 10 months ago
NousResearch / atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆749Updated this week
PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆780Updated this week
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆927Updated last week
google-deepmind / recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
☆652Updated 5 months ago
mistralai / megablocks-public
☆863Updated last year
xjdr-alt / entropix-local
smol models are fun too
☆92Updated last year
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆327Updated last year
apoorvumang / prompt-lookup-decoding
☆577Updated last year
cartesia-ai / edge
On-device intelligence.
☆389Updated 7 months ago
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆824Updated 3 months ago
FailSpy / abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆530Updated last year
NousResearch / Open-Reasoning-Tasks
A comprehensive repository of reasoning tasks for LLMs (and beyond)
☆450Updated last year
mistralai / mistral-common
Official inference library for pre-processing of Mistral models
☆815Updated this week
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,166Updated 9 months ago
microsoft / VPTQ
VPTQ, A Flexible and Extreme low-bit quantization algorithm
☆668Updated 6 months ago
open-thought / system-2-research
System 2 Reasoning Link Collection
☆855Updated 8 months ago
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆232Updated last year
vgel / repeng
A library for making RepE control vectors
☆662Updated 2 months ago
wbrickner / noise_step
noise_step: Training in 1.58b With No Gradient Memory
☆222Updated 10 months ago
dropbox / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆891Updated 3 weeks ago
PsycheFoundation / psyche
An open infrastructure to democratize and decentralize the development of superintelligence for humanity.
☆513Updated this week
open-thought / reasoning-gym
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,222Updated last week
aidanmclaughlin / AidanBench
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆315Updated 4 months ago
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆847Updated last month
pranavjad / mlx-gpt2
gpt-2 from scratch in mlx
☆405Updated last year
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 8 months ago
ironjr / grokfast
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
☆564Updated last year
facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,394Updated 7 months ago