tmbdev-archive / webdataset-imagenet-2Links

A small repository demonstrating the use of Webdataset and Imagenet

☆17

Alternatives and similar repositories for webdataset-imagenet-2

Users that are interested in webdataset-imagenet-2 are comparing it to the libraries listed below

Sorting:

FutureXiang / edm2
Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"
☆34Updated last year
mlfoundations / patching
Patching open-vocabulary models by interpolating weights
☆91Updated last year
ShivamDuggal4 / karl
Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?
☆35Updated 2 weeks ago
mlfoundations / imagenet-captions
Release of ImageNet-Captions
☆50Updated 2 years ago
LAION-AI / scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
☆172Updated last month
baaivision / MUSE-Pytorch
An in-context conditioning version of MUSE with pre-trained checkpoints.
☆113Updated 2 years ago
NVlabs / TCM
Codebase of Truncated Consistency Models (ICLR 2025)
☆27Updated 6 months ago
facebookresearch / maws
Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496
☆91Updated 3 months ago
hjbahng / cyclereward
CycleReward is a reward model trained on cycle consistency preferences to measure image-text alignment.
☆37Updated last month
facebookresearch / long_seq_mae
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Updated 2 years ago
salesforce / MUST
PyTorch code for MUST
☆107Updated 3 months ago
tsb0601 / MultiMon
☆25Updated 2 years ago
patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆83Updated 2 years ago
TomerRonen34 / mixed-resolution-vit
☆51Updated last year
mbaradad / learning_with_noise
Learning to See by Looking at Noise
☆111Updated 8 months ago
ziplab / Mesa
This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".
☆120Updated 3 years ago
ryanwebster90 / snip-dedup
☆104Updated last year
enyac-group / supmae
This is a offical PyTorch/GPU implementation of SupMAE.
☆78Updated 2 years ago
ShivamDuggal4 / adaptive-length-tokenizer
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆126Updated 5 months ago
nanlliu / Unsupervised-Compositional-Concepts-Discovery
[ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
☆85Updated last year
wilson1yan / teco
☆121Updated 5 months ago
LargeWorldModel / ElasticTok
ElasticTok: Adaptive Tokenization for Image and Video
☆74Updated 9 months ago
huiwon-jang / CoordTok
☆37Updated 6 months ago
allenai / grit_official
Official repository for the General Robust Image Task (GRIT) Benchmark
☆54Updated 2 years ago
mwalmer-umd / vit_analysis
☆36Updated 2 years ago
ExplainableML / ImageSelect
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Updated 2 years ago
yilundu / comet
[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts
☆62Updated 2 years ago
tcl9876 / Denoising_Student
☆31Updated 4 years ago
NVlabs / TokenBench
A Video Tokenizer Evaluation Dataset
☆129Updated 6 months ago
facebookresearch / ViP-MAE
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆36Updated 2 years ago