rom1504 / img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆3,730Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for img2dataset
- Easily compute clip embeddings and build a clip retrieval system with them☆2,415Updated 7 months ago
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆4,831Updated 3 months ago
- An open source implementation of CLIP.☆10,367Updated last week
- An open-source framework for training large multimodal models.☆3,751Updated 2 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆9,953Updated this week
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,321Updated last year
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,258Updated last week
- ☆3,140Updated 6 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆6,374Updated 5 months ago
- Grounded Language-Image Pre-training☆2,231Updated 9 months ago
- ☆2,929Updated last year
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,397Updated 3 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆8,668Updated this week
- Taming Transformers for High-Resolution Image Synthesis☆5,818Updated 3 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆9,261Updated 3 months ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,312Updated 3 months ago
- Karras et al. (2022) diffusion models for PyTorch☆2,331Updated 4 months ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆7,964Updated this week
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆2,344Updated 2 months ago
- Consistency Distilled Diff VAE☆2,137Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆11,901Updated 8 months ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,331Updated last month
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,376Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,018Updated 8 months ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,217Updated 2 years ago
- This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".☆1,138Updated 10 months ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,422Updated 6 months ago
- Official repo for consistency models.☆6,170Updated 8 months ago
- ☆6,281Updated 4 months ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,832Updated 4 months ago