SHI-Labs / Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,321Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Versatile-Diffusion
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,298Updated 2 years ago
- ☆3,140Updated 6 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,543Updated 10 months ago
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,075Updated last month
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆870Updated 8 months ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,257Updated 6 months ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆702Updated 9 months ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,156Updated last year
- ☆2,929Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,376Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,000Updated last year
- ☆1,455Updated 10 months ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆741Updated last year
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,867Updated 11 months ago
- Deep Learning Examples☆811Updated last month
- ☆1,032Updated last year
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆752Updated 3 months ago
- Official implementation of VQ-Diffusion☆900Updated 7 months ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆879Updated last year
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆691Updated 7 months ago
- [ECCV 2022] Compositional Generation using Diffusion Models☆455Updated 3 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,180Updated last month
- CLIP+MLP Aesthetic Score Predictor☆905Updated 4 months ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,832Updated 4 months ago
- Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)☆936Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆733Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆2,331Updated 4 months ago
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,214Updated 4 months ago
- Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation…☆636Updated last year
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,923Updated 6 months ago