Photoroom/datago

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Photoroom/datago)

Photoroom / datago

A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while covering aspect ratio bucketing, crop and resize for image ML workloads.

☆128

Alternatives and similar repositories for datago

Users that are interested in datago are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Photoroom / PRX
View on GitHub
☆86May 5, 2026Updated 2 months ago
LouisRouss / DiffuLab
View on GitHub
DiffuLab is designed to provide a simple and flexible way to train diffusion models while allowing full customization of its core compone…
☆43Jan 11, 2026Updated 6 months ago
xjdr-alt / mla_blog_translation
View on GitHub
☆13Jun 18, 2024Updated 2 years ago
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
microsoft / SimpleEgo
View on GitHub
☆15Jul 7, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MehdiZouitine / Learning_to_repair_infeasible_problems_with_DRL_and_GNN
View on GitHub
☆14May 6, 2025Updated last year
SuReLI / RRLS
View on GitHub
Robust Reinforcement Learning Suite
☆37Dec 24, 2024Updated last year
raphaelsty / neural-tree
View on GitHub
Tree-based indexes for neural-search
☆33Mar 4, 2024Updated 2 years ago
lucaslingle / mu_transformer
View on GitHub
Official implementation of 'A Large-Scale Exploration of mu-Transfer' (CoRR 2024)
☆31Jun 5, 2025Updated last year
cloneofsimo / min-max-gpt
View on GitHub
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Apr 17, 2024Updated 2 years ago
ezyang / torchdbg
View on GitHub
PyTorch centric eager mode debugger
☆46Dec 16, 2024Updated last year
xjdr-alt / muzero_sketch
View on GitHub
☆40Jul 26, 2024Updated 2 years ago
lightonai / fastkmeans-rs
View on GitHub
A Rust rewrite of FastKMeans for CPU-based clustering
☆17Jun 29, 2026Updated last month
deel-ai / oodeel
View on GitHub
Simple, compact, and hackable post-hoc deep OOD detection for already trained tensorflow or pytorch image classifiers.
☆61May 19, 2026Updated 2 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
eth-easl / fmengine
View on GitHub
Utilities for Training Very Large Models
☆58Sep 25, 2024Updated last year
flying-sky999 / OmniV2V
View on GitHub
☆15Jun 2, 2025Updated last year
cloneofsimo / minDinoV2
View on GitHub
☆24Oct 15, 2024Updated last year
HomebrewML / HeavyBall
View on GitHub
Efficient optimizers
☆336Updated this week
PD-Mera / anime-vs-reality-classification
View on GitHub
This project simply uses torchvision pretrained model to finetune and classify whether an image is anime or reality
☆14Aug 11, 2025Updated 11 months ago
frinkleko / LIMIT-Sparse-Embedding
View on GitHub
Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…
☆16Sep 4, 2025Updated 10 months ago
Lucas-rbnt / DRIM
View on GitHub
[MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data
☆20Apr 3, 2025Updated last year
dbrainio / wrappa
View on GitHub
Server wrapper for ml models
☆11Sep 11, 2019Updated 6 years ago
riverstone496 / awesome-second-order-optimization
View on GitHub
☆32May 17, 2026Updated 2 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
lernapparat / torchhacks
View on GitHub
Hacks for PyTorch
☆19Apr 18, 2023Updated 3 years ago
DIUx-xView / xView3_first_place
View on GitHub
Contains source code for the winning solution of the xView3 challenge https://iuu.xview.us/.
☆20Jul 27, 2022Updated 4 years ago
lightonai / ducksearch
View on GitHub
Efficient BM25 with DuckDB 🦆
☆69Dec 20, 2024Updated last year
sgugger / torchdynamo-tests
View on GitHub
☆20Nov 23, 2022Updated 3 years ago
peterwilli / Thingy
View on GitHub
All things generative! Discord Bot
☆21Aug 13, 2023Updated 2 years ago
SuReLI / SGQN
View on GitHub
☆19Jun 30, 2024Updated 2 years ago
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
MehdiZouitine / Learning-Disentangled-Representations-via-Mutual-Information-Estimation
View on GitHub
Pytorch implementation of Learning Disentangled Representations via Mutual Information Estimation (ECCV 2020)
☆84Aug 3, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ElisaNguyen / bayesian-tda
View on GitHub
Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"
☆17Jan 12, 2024Updated 2 years ago
turbopuffer / turbogrep
View on GitHub
semantic, fts, regex grep tool using tpuf
☆19May 18, 2026Updated 2 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
View on GitHub
☆47Aug 29, 2024Updated last year
LaurentMazare / tboard-rs
View on GitHub
Read and write tensorboard data using Rust
☆23Feb 4, 2024Updated 2 years ago
beamlab-hsph / journalclub
View on GitHub
Reading great papers in the history of artificial intelligence and machine learning
☆10Oct 26, 2022Updated 3 years ago
Chillee / lit-llama
View on GitHub
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆10Aug 29, 2023Updated 2 years ago
adefazio / sampler
View on GitHub
Dynamic weighted sampling with replacement
☆14Mar 19, 2016Updated 10 years ago