CasualGANPapers / Make-A-Scene
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
☆334Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Make-A-Scene
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆699Updated 9 months ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆310Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆523Updated 11 months ago
- ☆328Updated last year
- Finetune glide-text2im from openai on your own data.☆88Updated 2 years ago
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆321Updated last year
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆180Updated last year
- 1.4B latent diffusion model fine tuning☆261Updated 2 years ago
- ☆350Updated 2 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆277Updated 2 years ago
- Benchmarking Generative Models with Artworks☆222Updated 2 years ago
- A phenaki reproduction using pytorch.☆219Updated last year
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆560Updated 5 months ago
- ☆154Updated last year
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆206Updated 5 months ago
- [ECCV 2022] Compositional Generation using Diffusion Models☆455Updated 3 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆266Updated last year
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆192Updated last year
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆268Updated 6 months ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆545Updated last year
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆399Updated 2 years ago
- Unofficial implementation of Tune-A-Video☆191Updated last year
- ☆198Updated 2 years ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆514Updated 10 months ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆487Updated 2 years ago
- [CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2☆356Updated last year
- Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)☆307Updated 2 years ago
- Pretrained Dalle2 from laion☆500Updated last year
- Official Jax Implementation of MaskGIT☆449Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆368Updated last year