NousResearch / DisTrO
Distributed Training Over-The-Internet
☆806Updated 2 weeks ago
Alternatives and similar repositories for DisTrO:
Users that are interested in DisTrO are comparing it to the libraries listed below
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆556Updated this week
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training☆379Updated 3 weeks ago
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆543Updated last week
- NanoGPT (124M) in 5 minutes☆1,773Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆821Updated last week
- Official implementation of Half-Quadratic Quantization (HQQ)☆715Updated 3 weeks ago
- ☆736Updated 3 months ago
- Code for BLT research paper☆358Updated this week
- nanoGPT style version of Llama 3.1☆1,268Updated 4 months ago
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,236Updated this week
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆355Updated 6 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆612Updated 5 months ago
- NVIDIA Linux open GPU with P2P support☆939Updated 6 months ago
- Minimalistic large language model 3D-parallelism training☆1,331Updated this week
- Tile primitives for speedy kernels☆1,745Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Models☆1,232Updated last month
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆790Updated this week
- Felafax is building AI infra for non-NVIDIA GPUs☆540Updated this week
- System 2 Reasoning Link Collection☆702Updated this week
- PyTorch native quantization and sparsity for training and inference☆1,667Updated this week
- GRadient-INformed MoE☆261Updated 2 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆771Updated last month
- ☆479Updated 3 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,188Updated 3 weeks ago
- Everything about the SmolLM & SmolLM2 family of models☆1,425Updated 2 weeks ago
- UNet diffusion model in pure CUDA☆587Updated 5 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆586Updated this week
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆774Updated last week
- smol models are fun too☆84Updated last month