Photoroom / datagoLinks
A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while covering aspect ratio bucketing, crop and resize for image ML workloads.
☆123Updated this week
Alternatives and similar repositories for datago
Users that are interested in datago are comparing it to the libraries listed below
Sorting:
- Framework based on a vector dabase to store, manage and curate large image datasets☆80Updated 3 months ago
- ☆91Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 4 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 9 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- WIP☆93Updated last year
- ☆314Updated last year
- Focused on fast experimentation and simplicity☆76Updated 11 months ago
- Train vision models using JAX and 🤗 transformers☆100Updated last month
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆124Updated 8 months ago
- ☆34Updated last year
- Scalable and Performant Data Loading☆352Updated this week
- supporting pytorch FSDP for optimizers☆84Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆116Updated 3 months ago
- ☆59Updated last year
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- ☆50Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆45Updated this week
- Efficient optimizers☆277Updated last month
- ☆89Updated 5 months ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆23Updated last year
- Modular, scalable library to train ML models☆178Updated last week
- ☆63Updated last year
- ☆82Updated last year
- NLP with Rust for Python 🦀🐍☆70Updated 7 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- run paligemma in real time☆133Updated last year
- research impl of Native Sparse Attention (2502.11089)☆63Updated 9 months ago