tmbdev-archive / webdataset-imagenet-2
A small repository demonstrating the use of Webdataset and Imagenet
☆14Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for webdataset-imagenet-2
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆27Updated 3 weeks ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆26Updated 8 months ago
- ☆40Updated last year
- ☆33Updated last year
- ☆22Updated last year
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆33Updated 3 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆37Updated last year
- ☆26Updated 3 years ago
- Release of ImageNet-Captions☆45Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆31Updated 6 months ago
- Patching open-vocabulary models by interpolating weights☆90Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- A Video Tokenizer Evaluation Dataset☆48Updated 2 weeks ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆21Updated last year
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆75Updated 7 months ago
- ☆48Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆20Updated last year
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆53Updated last year
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆20Updated last month
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆33Updated 2 weeks ago
- Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist☆38Updated 11 months ago
- Official code for "On Calibrating Diffusion Probabilistic Models"☆29Updated last year
- Code for the paper High Fidelity Image Synthesis With Deep VAEs In Latent Space.☆29Updated last year
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆40Updated last year
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆105Updated 4 months ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆69Updated 10 months ago