rom1504 / laion-preproLinks
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆223Updated last year
Alternatives and similar repositories for laion-prepro
Users that are interested in laion-prepro are comparing it to the libraries listed below
Sorting:
- Description and pointers of laion datasets☆246Updated 2 years ago
- ☆336Updated 2 years ago
- Let's make a video clip☆96Updated 3 years ago
- ☆104Updated last year
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆338Updated 3 years ago
- Easily compute clip embeddings from video frames☆146Updated last year
- Benchmarking Generative Models with Artworks☆232Updated 2 years ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆320Updated last year
- Efficiently read embedding in streaming from any filesystem☆102Updated last month
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆401Updated 2 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆134Updated 2 years ago
- Open reproduction of MUSE for fast text2image generation.☆356Updated last year
- Unofficial implementation of Tune-A-Video☆194Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆657Updated last year
- Finetune glide-text2im from openai on your own data.☆89Updated last week
- Optimized library for large-scale extraction of frames and audio from video.☆204Updated 2 years ago
- ☆108Updated 2 years ago
- A phenaki reproduction using pytorch.☆220Updated last year
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 3 years ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆591Updated 3 years ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆290Updated 2 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆316Updated last year
- ☆121Updated 2 years ago
- code for CLIPDraw☆144Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆482Updated last year
- Easily create large video dataset from video urls☆633Updated last year
- 1.4B latent diffusion model fine tuning☆266Updated 3 years ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 2 years ago
- ☆545Updated 8 months ago