PrimeIntellect-ai / OpenDilocoLinks

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

☆550

Alternatives and similar repositories for OpenDiloco

Users that are interested in OpenDiloco are comparing it to the libraries listed below

Sorting:

PrimeIntellect-ai / prime-diloco
prime is a framework for efficient, globally distributed training of AI models over the internet.
☆850Updated 2 weeks ago
NousResearch / DisTrO
Distributed Training Over-The-Internet
☆966Updated last month
PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆867Updated this week
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆360Updated 11 months ago
microsoft / VPTQ
VPTQ, A Flexible and Extreme low-bit quantization algorithm
☆668Updated 7 months ago
apoorvumang / prompt-lookup-decoding
☆581Updated last year
NVIDIA / Star-Attention
Efficient LLM Inference over Long Sequences
☆392Updated 5 months ago
Infini-AI-Lab / Sequoia
scalable and robust tree-based speculative decoding algorithm
☆363Updated 10 months ago
MoonshotAI / checkpoint-engine
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆851Updated last week
apple / ml-recurrent-drafter
☆219Updated 10 months ago
thinking-machines-lab / batch_invariant_ops
☆917Updated last month
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 9 months ago
NVIDIA / kvpress
LLM KV cache compression made easy
☆701Updated this week
meta-pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆454Updated 3 weeks ago
dropbox / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆894Updated last month
hao-ai-lab / Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
☆407Updated last year
NousResearch / atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆760Updated this week
intel / auto-round
Advanced quantization toolkit for LLMs and VLMs. Native support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Bits and seamless integration with …
☆735Updated this week
wbrickner / noise_step
noise_step: Training in 1.58b With No Gradient Memory
☆221Updated 11 months ago
HazyResearch / lolcats
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
☆249Updated 10 months ago
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆919Updated 2 months ago
meta-pytorch / torchforge
PyTorch-native post-training at scale
☆549Updated last week
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆850Updated last month
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆329Updated last year
foundation-model-stack / fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆271Updated last week
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,372Updated 4 months ago
Cornell-RelaxML / quip-sharp
☆570Updated last year
LeanModels / DFloat11
DFloat11: Lossless LLM Compression for Efficient GPU Inference
☆569Updated last week
neuralmagic / nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆267Updated last year