rom1504 / laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆220Updated 11 months ago
Alternatives and similar repositories for laion-prepro:
Users that are interested in laion-prepro are comparing it to the libraries listed below
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆315Updated last year
- Description and pointers of laion datasets☆246Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆131Updated last year
- Efficiently read embedding in streaming from any filesystem☆99Updated last year
- ☆103Updated last year
- Open reproduction of MUSE for fast text2image generation.☆350Updated 11 months ago
- Let's make a video clip☆93Updated 2 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆314Updated 11 months ago
- ☆115Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆631Updated 8 months ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆541Updated 2 years ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆282Updated last year
- ☆500Updated 4 months ago
- ☆171Updated last year
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆389Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆89Updated 2 years ago
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆325Updated 2 years ago
- Easily compute clip embeddings from video frames☆145Updated last year
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆336Updated 2 years ago
- ☆335Updated 2 years ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆534Updated last year
- Unofficial implementation of Tune-A-Video☆193Updated 2 years ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆420Updated last year
- ☆208Updated last year
- Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)☆162Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆404Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆168Updated 2 weeks ago
- 1.4B latent diffusion model fine tuning☆265Updated 2 years ago
- Benchmarking Generative Models with Artworks☆227Updated 2 years ago