rom1504 / laion-preproLinks
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆223Updated last year
Alternatives and similar repositories for laion-prepro
Users that are interested in laion-prepro are comparing it to the libraries listed below
Sorting:
- Description and pointers of laion datasets☆248Updated 3 years ago
- ☆103Updated last year
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆410Updated 5 months ago
- Let's make a video clip☆96Updated 3 years ago
- ☆336Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆104Updated 4 months ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆320Updated 2 years ago
- Easily compute clip embeddings from video frames☆147Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Open reproduction of MUSE for fast text2image generation.☆359Updated last year
- ☆108Updated 3 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆338Updated 3 years ago
- Benchmarking Generative Models with Artworks☆235Updated 3 years ago
- Unofficial implementation of Tune-A-Video☆193Updated 2 years ago
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆129Updated 3 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆668Updated last year
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆320Updated last year
- Finetune glide-text2im from openai on your own data.☆88Updated 2 months ago
- code for CLIPDraw☆145Updated 3 years ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 3 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆205Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆98Updated 2 years ago
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆484Updated 2 years ago
- ☆130Updated 2 years ago
- A phenaki reproduction using pytorch.☆220Updated 2 years ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆646Updated 3 years ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆293Updated 2 years ago
- Iterable datapipelines for pytorch training.☆88Updated last year
- Retrieval augmented diffusion from CompVis.☆53Updated 3 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago