A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while covering aspect ratio bucketing, crop and resize for image ML workloads.
☆127Feb 26, 2026Updated this week
Alternatives and similar repositories for datago
Users that are interested in datago are comparing it to the libraries listed below
Sorting:
- ☆40Jul 26, 2024Updated last year
- Server wrapper for ml models☆11Sep 11, 2019Updated 6 years ago
- ☆13Jun 18, 2024Updated last year
- Tree-based indexes for neural-search☆31Mar 4, 2024Updated 2 years ago
- Efficient optimizers☆285Dec 20, 2025Updated 2 months ago
- In-N-Out: Towards Good Initialization for Inpainting and Outpainting (BMVC 2021)☆12Dec 15, 2021Updated 4 years ago
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆38Feb 18, 2026Updated 2 weeks ago
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- ☆13Jul 7, 2025Updated 7 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 6 months ago
- Dynamic weighted sampling with replacement☆14Mar 19, 2016Updated 9 years ago
- DiffuLab is designed to provide a simple and flexible way to train diffusion models while allowing full customization of its core compone…☆43Jan 11, 2026Updated last month
- ☆10Oct 22, 2024Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆21Feb 26, 2024Updated 2 years ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 7 months ago
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆52Feb 25, 2026Updated last week
- All things generative! Discord Bot☆21Aug 13, 2023Updated 2 years ago
- Cortex-compatible model server for Python and TensorFlow☆18Nov 27, 2022Updated 3 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- ☆48Feb 23, 2025Updated last year
- Read and write tensorboard data using Rust☆24Feb 4, 2024Updated 2 years ago
- ☆21Apr 17, 2023Updated 2 years ago
- ☆23Jun 18, 2024Updated last year
- Browser viewer for GaussianAvatars based on Brush☆25Dec 23, 2024Updated last year
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆31Apr 11, 2024Updated last year
- ☆21Mar 3, 2025Updated last year
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- A tiny deep learning library written in Java☆27Feb 12, 2023Updated 3 years ago
- Simple, compact, and hackable post-hoc deep OOD detection for already trained tensorflow or pytorch image classifiers.☆60Feb 17, 2026Updated 2 weeks ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- ☆27Mar 13, 2021Updated 4 years ago
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆130Feb 4, 2026Updated last month
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Oct 8, 2025Updated 4 months ago
- A library for unit scaling in PyTorch☆133Jul 11, 2025Updated 7 months ago
- ☆34Sep 10, 2024Updated last year
- ☆541Oct 10, 2025Updated 4 months ago
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆116Nov 3, 2025Updated 4 months ago
- Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering☆38May 30, 2023Updated 2 years ago