Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
β2,411Jan 11, 2026Updated 2 months ago
Alternatives and similar repositories for hivemind
Users that are interested in hivemind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)β118Jan 13, 2022Updated 4 years ago
- πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloadingβ10,038Sep 7, 2024Updated last year
- β398Jan 31, 2026Updated 2 months ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"β149Dec 11, 2023Updated 2 years ago
- "Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts" (NeurIPS 2020), original PyTorch implemenβ¦β56Nov 5, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)β27May 29, 2023Updated 2 years ago
- Memory-efficient transformer. Work in progress.β19Sep 17, 2022Updated 3 years ago
- PyTorch extensions for high performance and large scale training.β3,404Apr 26, 2025Updated 11 months ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementationβ30Feb 4, 2025Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed librariesβ7,408Feb 3, 2026Updated 2 months ago
- prime is a framework for efficient, globally distributed training of AI models over the internet.β851Nov 16, 2025Updated 4 months ago
- Accessible large language models via k-bit quantization for PyTorch.β8,107Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,990Apr 1, 2026Updated last week
- Efficient Deep Learning Systems course materials (HSE, YSDA)β972Mar 14, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Trainingβ563Jan 13, 2025Updated last year
- A Smart, Automatic, Fast and Lightweight Web Scraper for Pythonβ7,133Jun 9, 2025Updated 10 months ago
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,596Apr 2, 2026Updated last week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β9,450Feb 20, 2026Updated last month
- Running large language models on a single GPU for throughput-oriented scenarios.β9,376Oct 28, 2024Updated last year
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.β1,077Apr 17, 2024Updated last year
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clβ¦β9,760Updated this week
- Development repository for the Triton language and compilerβ18,840Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)β¦β14,458Mar 30, 2026Updated last week
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,977Updated this week
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languagesβ11Feb 9, 2025Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β4,743Jan 8, 2024Updated 2 years ago
- Accelerated deep learning R&Dβ3,375Jun 27, 2025Updated 9 months ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ35,311Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nβ¦β4,716Updated this week
- Library for 8-bit optimizers and quantization routines.β780Aug 18, 2022Updated 3 years ago
- functorch is JAX-like composable function transforms for PyTorch.β1,436Aug 21, 2025Updated 7 months ago
- Fast and memory-efficient exact attentionβ23,185Updated this week
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Go ahead and axolotl questionsβ11,608Updated this week
- A data augmentations library for audio, image, text, and video.β5,070Mar 31, 2026Updated last week
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) trainingβ24,044Aug 15, 2024Updated last year
- Incentivized Training over Wide Web with 1000x model compression.β22Oct 30, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,202Sep 30, 2025Updated 6 months ago
- Lightning Training strategy for HiveMindβ18Jan 20, 2026Updated 2 months ago
- Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.β1,476May 2, 2025Updated 11 months ago