rom1504 / laion-preproLinks
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆222Updated last year
Alternatives and similar repositories for laion-prepro
Users that are interested in laion-prepro are comparing it to the libraries listed below
Sorting:
- Description and pointers of laion datasets☆245Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆404Updated 3 months ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆320Updated last year
- Open reproduction of MUSE for fast text2image generation.☆355Updated last year
- ☆103Updated last year
- Benchmarking Generative Models with Artworks☆230Updated 2 years ago
- ☆335Updated 2 years ago
- Let's make a video clip☆95Updated 3 years ago
- Unofficial implementation of Tune-A-Video☆193Updated 2 years ago
- Easily compute clip embeddings from video frames☆146Updated last year
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆336Updated 3 years ago
- Finetune glide-text2im from openai on your own data.☆89Updated 3 weeks ago
- ☆108Updated 3 years ago
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 3 years ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆86Updated 2 years ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆603Updated 3 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆658Updated last year
- ☆125Updated 2 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆318Updated last year
- Iterable datapipelines for pytorch training.☆86Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆289Updated 2 years ago
- A phenaki reproduction using pytorch.☆219Updated 2 years ago
- code for CLIPDraw☆144Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆97Updated 2 years ago
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆482Updated last year
- Easily create large video dataset from video urls☆634Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆427Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆163Updated 2 years ago
- ☆53Updated 2 years ago