rom1504 / laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆206Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for laion-prepro
- Easily compute clip embeddings from video frames☆136Updated last year
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆310Updated last year
- Efficiently read embedding in streaming from any filesystem☆96Updated 6 months ago
- Description and pointers of laion datasets☆235Updated 2 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆298Updated 5 months ago
- ☆100Updated 9 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆266Updated last year
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆368Updated last year
- ☆102Updated last year
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆487Updated 2 years ago
- Open reproduction of MUSE for fast text2image generation.☆332Updated 5 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆126Updated last year
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆334Updated 2 years ago
- ☆311Updated last year
- Finetune glide-text2im from openai on your own data.☆88Updated 2 years ago
- Unofficial implementation of Tune-A-Video☆191Updated last year
- Let's make a video clip☆93Updated 2 years ago
- Benchmarking Generative Models with Artworks☆222Updated 2 years ago
- Easily create large video dataset from video urls☆546Updated 3 months ago
- ☆156Updated last year
- ☆328Updated last year
- Code for instruction-tuning Stable Diffusion.☆212Updated 9 months ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆95Updated last year
- Aim for the moon. If you miss, you may hit a star.☆160Updated last year
- Large-scale text-video dataset. 10 million captioned short videos.☆602Updated 3 months ago
- ☆442Updated 9 months ago
- ☆108Updated 2 years ago
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆399Updated 2 years ago
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆321Updated last year
- ☆169Updated 7 months ago