Lightning-AI / litdata
Transform datasets at scale. Optimize datasets for fast AI model training.
☆417Updated last week
Alternatives and similar repositories for litdata:
Users that are interested in litdata are comparing it to the libraries listed below
- Scalable and Performant Data Loading☆222Updated this week
- Helpful tools and examples for working with flex-attention☆662Updated last week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,290Updated this week
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆227Updated last month
- TensorDict is a pytorch dedicated tensor container.☆889Updated this week
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆357Updated 2 weeks ago
- For optimization algorithm research and development.☆497Updated this week
- PyTorch per step fault tolerance (actively under development)☆253Updated last week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆515Updated last week
- A pytorch quantization backend for optimum☆891Updated last month
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆503Updated 4 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆550Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆872Updated this week
- Pipeline Parallelism for PyTorch☆753Updated 6 months ago
- Annotated version of the Mamba paper☆474Updated last year
- TorchFix - a linter for PyTorch-using code with autofix support☆133Updated 3 weeks ago
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆53Updated this week
- ☆301Updated 8 months ago
- ☆344Updated 3 weeks ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆434Updated 6 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆156Updated 11 months ago
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆225Updated this week
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆434Updated this week
- 🤖 A PyTorch library of curated Transformer models and their composable components☆880Updated 10 months ago
- PyTorch native quantization and sparsity for training and inference☆1,869Updated this week
- A repository for research on medium sized language models.☆492Updated last month
- Best practices & guides on how to write distributed pytorch training code☆356Updated last week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆775Updated 2 weeks ago
- ☆191Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆849Updated last week