facebookresearch / spdlLinks
Scalable and Performant Data Loading
☆299Updated this week
Alternatives and similar repositories for spdl
Users that are interested in spdl are comparing it to the libraries listed below
Sorting:
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆395Updated 2 weeks ago
- Load compute kernels from the Hub☆271Updated this week
- For optimization algorithm research and development.☆534Updated last week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆575Updated last month
- Transform datasets at scale. Optimize datasets for fast AI model training.☆536Updated this week
- PyTorch Single Controller☆414Updated this week
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆265Updated last month
- Helpful tools and examples for working with flex-attention☆970Updated this week
- ☆307Updated last year
- This repository contains the experimental PyTorch native float8 training UX☆224Updated last year
- Efficient optimizers☆261Updated last month
- ☆519Updated last month
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆537Updated 3 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated last month
- Dion optimizer algorithm☆338Updated last week
- PyTorch media decoding and encoding☆694Updated last week
- ☆216Updated 7 months ago
- TensorDict is a pytorch dedicated tensor container.☆966Updated this week
- A library for unit scaling in PyTorch☆130Updated 2 months ago
- A tool to configure, launch and manage your machine learning experiments.☆190Updated this week
- ☆168Updated last year
- Annotated version of the Mamba paper☆489Updated last year
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆408Updated 5 months ago
- ☆330Updated this week
- Accelerated First Order Parallel Associative Scan☆188Updated last year
- ☆301Updated 4 months ago
- ☆279Updated last year
- ☆118Updated last year
- Implementation of a Transformer, but completely in Triton☆274Updated 3 years ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆269Updated last month