Lightning-AI / litdata
Transform datasets at scale. Optimize datasets for fast AI model training.
☆367Updated this week
Related projects ⓘ
Alternatives and complementary repositories for litdata
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,199Updated this week
- TensorDict is a pytorch dedicated tensor container.☆840Updated last week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆483Updated 3 weeks ago
- For optimization algorithm research and development.☆449Updated this week
- Helpful tools and examples for working with flex-attention☆469Updated 3 weeks ago
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆216Updated last week
- ☆292Updated 4 months ago
- Annotated version of the Mamba paper☆457Updated 8 months ago
- Pipeline Parallelism for PyTorch☆726Updated 3 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆328Updated last month
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆516Updated this week
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆193Updated this week
- A pytorch quantization backend for optimum☆824Updated last week
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆476Updated 3 weeks ago
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆51Updated this week
- Minimalistic large language model 3D-parallelism training☆1,260Updated this week
- ☆303Updated this week
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- Best practices & guides on how to write distributed pytorch training code☆286Updated 2 weeks ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 3 months ago
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆408Updated 7 months ago
- PyTorch native quantization and sparsity for training and inference☆1,585Updated this week
- This repository contains the experimental PyTorch native float8 training UX☆211Updated 3 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆332Updated this week
- Puzzles for learning Triton☆1,135Updated this week
- Recipes are a standard, well supported set of blueprints for machine learning engineers to rapidly train models using the latest research…☆294Updated this week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆742Updated this week
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆457Updated last week
- A repository for research on medium sized language models.☆479Updated this week