SHI-Labs / Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,331Updated last year
Alternatives and similar repositories for Versatile-Diffusion:
Users that are interested in Versatile-Diffusion are comparing it to the libraries listed below
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,332Updated 2 years ago
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,117Updated 6 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Updated last year
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,937Updated last year
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,308Updated last year
- ☆3,296Updated 11 months ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,179Updated 2 years ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,029Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆891Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆734Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,415Updated last year
- ☆2,999Updated 2 years ago
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,278Updated 9 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆769Updated 9 months ago
- Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)☆972Updated last year
- Diffusion attentive attribution maps for interpreting Stable Diffusion.☆752Updated last year
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆951Updated 2 years ago
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆593Updated 9 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆749Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆700Updated 3 months ago
- ☆1,033Updated 2 years ago
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆531Updated last year
- Official implementation of VQ-Diffusion☆933Updated last year
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆745Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆2,453Updated 4 months ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆888Updated 2 years ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,895Updated 3 months ago
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,680Updated last year
- [CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models☆840Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,313Updated last year