dome272 / Paella
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
☆741Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Paella
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,156Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆523Updated 11 months ago
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,075Updated last month
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆869Updated 8 months ago
- ☆693Updated last year
- Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)☆936Updated last year
- Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation…☆636Updated last year
- 1.4B latent diffusion model fine tuning☆261Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆699Updated 9 months ago
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆399Updated 2 years ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆733Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆401Updated last year
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆528Updated 7 months ago
- [ECCV 2022] Compositional Generation using Diffusion Models☆455Updated 3 months ago
- stable diffusion training☆291Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆334Updated 2 years ago
- MinImagen: A minimal implementation of the Imagen text-to-image model☆295Updated last year
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,321Updated last year
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,297Updated 2 years ago
- A phenaki reproduction using pytorch.☆219Updated last year
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆514Updated 10 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆752Updated 3 months ago
- ☆455Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆999Updated last year
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆321Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,214Updated 4 months ago
- Deep Learning Examples☆811Updated last month
- ☆1,455Updated 10 months ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆879Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,542Updated 10 months ago