Distributed Training Over-The-Internet
☆978Oct 14, 2025Updated 4 months ago
Alternatives and similar repositories for DisTrO
Users that are interested in DisTrO are comparing it to the libraries listed below
Sorting:
- DeMo: Decoupled Momentum Optimization☆198Dec 2, 2024Updated last year
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training☆562Jan 13, 2025Updated last year
- An open infrastructure to democratize and decentralize the development of superintelligence for humanity.☆600Updated this week
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆851Nov 16, 2025Updated 3 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆872Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆3,434Nov 13, 2024Updated last year
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆459Sep 27, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Implementation for MatMul-free LM.☆3,057Dec 2, 2025Updated 3 months ago
- Incentivized Training over Wide Web with 1000x model compression.☆22Oct 30, 2024Updated last year
- NanoGPT (124M) in 2 minutes☆4,679Updated this week
- Efficient Triton Kernels for LLM Training☆6,162Updated this week
- GRadient-INformed MoE☆264Sep 25, 2024Updated last year
- Go ahead and axolotl questions☆11,335Updated this week
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated last week
- Tools for merging pretrained large language models.☆6,814Jan 26, 2026Updated last month
- ☆137Aug 19, 2024Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,752Jul 18, 2025Updated 7 months ago
- Tile primitives for speedy kernels☆3,183Updated this week
- Optimizing inference proxy for LLMs☆3,342Jan 28, 2026Updated last month
- Schedule-Free Optimization in PyTorch☆2,257May 21, 2025Updated 9 months ago
- ☆1,201Dec 22, 2025Updated 2 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆577Jun 28, 2024Updated last year
- PyTorch native post-training library☆5,691Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,375Feb 21, 2026Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆187Jan 19, 2026Updated last month
- Code for BLT research paper☆2,029Nov 3, 2025Updated 3 months ago
- Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild☆3,189Updated this week
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆248Jun 6, 2025Updated 8 months ago
- Structured Outputs☆13,456Feb 13, 2026Updated 2 weeks ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,444Dec 9, 2025Updated 2 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆663Feb 6, 2026Updated 3 weeks ago
- Training LLMs with QLoRA + FSDP☆1,537Nov 9, 2024Updated last year
- [ICLR 2025] Automated Design of Agentic Systems☆1,521Jan 28, 2025Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,184Aug 22, 2025Updated 6 months ago
- A State-Space Model with Rational Transfer Function Representation.☆83May 17, 2024Updated last year
- Minimal reproduction of DeepSeek R1-Zero☆12,853Updated this week
- PyTorch native quantization and sparsity for training and inference☆2,707Updated this week
- Run frontier AI locally.☆41,742Updated this week