PrimeIntellect-ai / prime-dilocoLinks

prime is a framework for efficient, globally distributed training of AI models over the internet.

☆850

Alternatives and similar repositories for prime-diloco

Users that are interested in prime-diloco are comparing it to the libraries listed below

Sorting:

PrimeIntellect-ai / OpenDiloco
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
☆550Updated 10 months ago
NousResearch / DisTrO
Distributed Training Over-The-Internet
☆966Updated last month
PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆867Updated this week
NousResearch / atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆760Updated this week
microsoft / VPTQ
VPTQ, A Flexible and Extreme low-bit quantization algorithm
☆668Updated 7 months ago
MoonshotAI / checkpoint-engine
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆851Updated last week
thinking-machines-lab / batch_invariant_ops
☆917Updated last month
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆360Updated 11 months ago
arcee-ai / DistillKit
An Open Source Toolkit For LLM Distillation
☆785Updated 4 months ago
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,372Updated 4 months ago
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,351Updated 2 weeks ago
apoorvumang / prompt-lookup-decoding
☆581Updated last year
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,172Updated 10 months ago
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,911Updated 3 months ago
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆919Updated 2 months ago
intel / auto-round
Advanced quantization toolkit for LLMs and VLMs. Native support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Bits and seamless integration with …
☆735Updated this week
dropbox / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆894Updated last month
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆932Updated 2 weeks ago
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆850Updated last month
NVIDIA / Star-Attention
Efficient LLM Inference over Long Sequences
☆392Updated 5 months ago
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆482Updated this week
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,118Updated 6 months ago
open-thought / reasoning-gym
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,242Updated 3 weeks ago
vllm-project / llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
☆2,296Updated this week
meta-pytorch / torchforge
PyTorch-native post-training at scale
☆549Updated last week
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆329Updated last year
efeslab / Nanoflow
A throughput-oriented high-performance serving framework for LLMs
☆918Updated last month
mlc-ai / xgrammar
Fast, Flexible and Portable Structured Generation
☆1,396Updated last week
NVIDIA / kvpress
LLM KV cache compression made easy
☆701Updated this week
cartesia-ai / edge
On-device intelligence.
☆391Updated 8 months ago