rom1504 / laion-preproLinks
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆222Updated last year
Alternatives and similar repositories for laion-prepro
Users that are interested in laion-prepro are comparing it to the libraries listed below
Sorting:
- Description and pointers of laion datasets☆244Updated 3 years ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆320Updated 2 years ago
- ☆103Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆405Updated 4 months ago
- ☆335Updated 2 years ago
- Open reproduction of MUSE for fast text2image generation.☆355Updated last year
- Efficiently read embedding in streaming from any filesystem☆102Updated 3 months ago
- Let's make a video clip☆95Updated 3 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆337Updated 3 years ago
- ☆108Updated 3 years ago
- Unofficial implementation of Tune-A-Video☆193Updated 2 years ago
- Easily compute clip embeddings from video frames☆147Updated 2 years ago
- Benchmarking Generative Models with Artworks☆233Updated 3 years ago
- A phenaki reproduction using pytorch.☆219Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆89Updated last month
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆86Updated 2 years ago
- ☆126Updated 2 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆319Updated last year
- code for CLIPDraw☆144Updated 3 years ago
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 3 years ago
- Iterable datapipelines for pytorch training.☆87Updated last year
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆619Updated 3 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆663Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆290Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆97Updated 2 years ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆100Updated 5 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆430Updated 2 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆205Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆163Updated 2 years ago