Scalable and Performant Data Loading
☆369Mar 9, 2026Updated this week
Alternatives and similar repositories for spdl
Users that are interested in spdl are comparing it to the libraries listed below
Sorting:
- ☆21Mar 3, 2025Updated last year
- Hacks for PyTorch☆19Apr 18, 2023Updated 2 years ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆481Updated this week
- Schedule-Free Optimization in PyTorch☆2,262May 21, 2025Updated 9 months ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆3,004Feb 9, 2026Updated last month
- A PyTorch native platform for training generative AI models☆5,111Updated this week
- PyTorch media decoding and encoding☆977Mar 3, 2026Updated last week
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆281Nov 24, 2025Updated 3 months ago
- Helpful tools and examples for working with flex-attention☆1,153Feb 8, 2026Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆203Jul 17, 2024Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆373Dec 12, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- ☆40Jul 26, 2024Updated last year
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- TORCH_TRACE parser for PT2☆78Feb 26, 2026Updated last week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,755Jul 18, 2025Updated 7 months ago
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆132Feb 4, 2026Updated last month
- A toolkit for scaling law research ⚖☆57Jan 27, 2025Updated last year
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Triton-based implementation of Sparse Mixture of Experts.☆268Oct 3, 2025Updated 5 months ago
- TensorDict is a pytorch dedicated tensor container.☆1,011Updated this week
- ☆124May 28, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- A Python package for PME (Public Market Equivalent) calculation☆13Jan 16, 2026Updated last month
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- BFloat16 Fused Adam Operator for PyTorch☆16Nov 16, 2024Updated last year
- Speed up model training by fixing data loading.☆577Mar 2, 2026Updated last week
- ☆33Nov 4, 2024Updated last year
- PyTorch native quantization and sparsity for training and inference☆2,722Updated this week
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆595Aug 12, 2025Updated 6 months ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆249Jun 6, 2025Updated 9 months ago
- ☆316Jun 21, 2024Updated last year
- Framework to reduce autotune overhead to zero for well known deployments.☆97Sep 19, 2025Updated 5 months ago
- Prototype routines for GPU quantization written using PyTorch.☆21Feb 8, 2026Updated last month
- Efficient Triton Kernels for LLM Training☆6,189Updated this week
- Minimalistic large language model 3D-parallelism training☆2,588Feb 19, 2026Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 2 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,181Feb 11, 2026Updated 3 weeks ago