Lightning-AI / litDataLinks

Transform datasets at scale. Optimize datasets for fast AI model training.

☆520

Alternatives and similar repositories for litData

Users that are interested in litData are comparing it to the libraries listed below

Sorting:

facebookresearch / spdl
Scalable and Performant Data Loading
☆291Updated this week
Lightning-AI / lightning-thunder
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…
☆1,385Updated this week
facebookresearch / optimizers
For optimization algorithm research and development.
☆524Updated this week
pytorch / tensordict
TensorDict is a pytorch dedicated tensor container.
☆949Updated last week
Lightning-AI / utilities
Common Python utilities and GitHub Actions in Lightning Ecosystem
☆56Updated this week
pytorch / torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…
☆236Updated 6 months ago
pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆377Updated this week
gradio-app / trackio
A lightweight, local-first, and free experiment tracking Python library built on top of 🤗 Datasets and Spaces.
☆570Updated this week
pytorch-labs / attention-gym
Helpful tools and examples for working with flex-attention
☆908Updated 3 weeks ago
NVIDIA-Merlin / dataloader
The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX
☆421Updated last year
pytorch-labs / monarch
PyTorch Single Controller
☆345Updated this week
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆463Updated 5 months ago
apple / ml-sigma-reparam
☆307Updated last year
huggingface / chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
☆158Updated last year
srush / annotated-mamba
Annotated version of the Mamba paper
☆487Updated last year
fferflo / einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
☆392Updated 4 months ago
NVIDIA-NeMo / Run
A tool to configure, launch and manage your machine learning experiments.
☆176Updated this week
google / fiddle
☆350Updated last week
pytorch / data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
☆1,215Updated this week
mit-ll-responsible-ai / hydra-zen
Create powerful Hydra applications without the yaml files and boilerplate code.
☆392Updated this week
BobMcDear / attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆567Updated this week
Slicer / light-the-torch
Install PyTorch distributions with computation backend auto-detection
☆254Updated 3 months ago
BlackHC / toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
☆437Updated 11 months ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆901Updated 3 months ago
pytorch / torchcodec
PyTorch media decoding and encoding
☆652Updated this week
Chris-hughes10 / pytorch-accelerated
A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop …
☆189Updated last month
pytorch-labs / torchfix
TorchFix - a linter for PyTorch-using code with autofix support
☆145Updated 6 months ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆381Updated this week
aimhubio / aimlflow
aim-mlflow integration
☆216Updated 2 years ago
pytorch / ao
PyTorch native quantization and sparsity for training and inference
☆2,227Updated this week