openai / consistencydecoder
Consistency Distilled Diff VAE
☆2,161Updated last year
Alternatives and similar repositories for consistencydecoder:
Users that are interested in consistencydecoder are comparing it to the libraries listed below
- Open-Set Grounded Text-to-Image Generation☆2,090Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆2,407Updated 2 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,004Updated 4 months ago
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,824Updated 2 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,020Updated last year
- InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)☆1,276Updated 9 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,763Updated last month
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,880Updated 2 months ago
- T2I-Adapter☆3,610Updated 8 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,329Updated last month
- Speed up Stable Diffusion with this one simple trick!☆1,325Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,553Updated last year
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,776Updated 4 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,729Updated 8 months ago
- CLIP+MLP Aesthetic Score Predictor☆1,009Updated 8 months ago
- ☆3,248Updated 10 months ago
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction☆933Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,159Updated 3 weeks ago
- Latte: Latent Diffusion Transformer for Video Generation.☆1,794Updated 2 weeks ago
- Transfer the ControlNet with any basemodel in diffusers🔥☆822Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,403Updated last year
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆540Updated 11 months ago
- A prompting enhancement library for transformers-type text embedding systems☆563Updated 2 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆744Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆583Updated 7 months ago
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆690Updated 2 months ago
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,410Updated 3 weeks ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆814Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,250Updated 8 months ago
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,924Updated last year