google-deepmind/asyncdiloco

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/asyncdiloco)

google-deepmind / asyncdiloco

☆51

Alternatives and similar repositories for asyncdiloco

Users that are interested in asyncdiloco are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PrimeIntellect-ai / diloco_simple
View on GitHub
torch implementation of diloco
☆24Updated this week
matttreed / diloco-sim
View on GitHub
☆23Jan 5, 2025Updated last year
giangdip2410 / HyperRouter
View on GitHub
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Nov 29, 2023Updated 2 years ago
Cranial-XIX / metric-residual-network
View on GitHub
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
☆20Jan 11, 2023Updated 3 years ago
cloneofsimo / zeroshampoo
View on GitHub
☆33Sep 10, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
PrimeIntellect-ai / smart-contracts
View on GitHub
Solidity contracts for the decentralized Prime Network protocol
☆26Jul 6, 2025Updated last year
xrsrke / pipegoose
View on GitHub
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Dec 14, 2023Updated 2 years ago
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
PrimeIntellect-ai / OpenDiloco
View on GitHub
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
☆582Updated this week
Aleph-Alpha-Research / scaling
View on GitHub
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆66Nov 18, 2025Updated 8 months ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
samsja / pydantic_config
View on GitHub
Manage ML configuration with pydantic
☆16Mar 18, 2026Updated 4 months ago
hxixixh / amo-release
View on GitHub
Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"
☆30May 3, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Cranial-XIX / longhorn
View on GitHub
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆57Dec 4, 2024Updated last year
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
graphcore-research / unit-scaling
View on GitHub
A library for unit scaling in PyTorch
☆134Jul 11, 2025Updated last year
lqiang67 / tex-diary
View on GitHub
Research Diary System - LaTeX-based academic diary with PDF/HTML compilation
☆31Sep 29, 2025Updated 9 months ago
UCSC-VLAA / FedConv
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…
☆25Apr 30, 2024Updated 2 years ago
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated 11 months ago
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
UlisseMini / activation_additions_hf
View on GitHub
Reimplementation of https://github.com/montemac/algebraic_value_editing in pure PyTorch for efficiency on large models
☆11Jun 28, 2023Updated 3 years ago
thecharlieblake / lovely-llama
View on GitHub
An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thu-spmi / PPT2DST
View on GitHub
☆11Oct 8, 2023Updated 2 years ago
twilio-labs / ml-training-api
View on GitHub
A service template for asynchronous machine learning model training
☆18Nov 24, 2024Updated last year
zakuro-ai / sakura
View on GitHub
Sakura is the ML library of the Zakuro framework. It provides asynchronous distributed training for Pytorch.
☆18Updated this week
yandex-research / swarm
View on GitHub
Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"
☆150Dec 11, 2023Updated 2 years ago
MarlonBecker / MSAM
View on GitHub
☆23Jan 23, 2024Updated 2 years ago
thomasj02 / AiFilter
View on GitHub
Local LLM-based social network filter
☆73Jan 31, 2024Updated 2 years ago
haileyschoelkopf / triton-index
View on GitHub
See https://github.com/cuda-mode/triton-index/ instead!
☆11May 8, 2024Updated 2 years ago
kimbochen / md-blogs
View on GitHub
A blog where I write about research papers and blog posts I read.
☆12Nov 20, 2024Updated last year
HealthML / active-segmentation
View on GitHub
ActiveSegmentation: A Simulation Framework for Benchmarking Active Learning Strategies for 3D Medical Image Segmentation
☆20Jul 5, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
apd10 / universal_memory_allocation
View on GitHub
☆15Apr 26, 2022Updated 4 years ago
shikaiqiu / compute-better-spent
View on GitHub
☆63Oct 3, 2024Updated last year
apple / ml-sigma-reparam
View on GitHub
☆315Jun 21, 2024Updated 2 years ago
samblouir / birdie
View on GitHub
☆15Jun 8, 2026Updated last month
Narsil / bloomserver
View on GitHub
☆39Oct 3, 2022Updated 3 years ago