openai / consistencydecoderLinks
Consistency Distilled Diff VAE
☆2,189Updated last year
Alternatives and similar repositories for consistencydecoder
Users that are interested in consistencydecoder are comparing it to the libraries listed below
Sorting:
- InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)☆1,326Updated last year
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,856Updated 5 months ago
- Open-Set Grounded Text-to-Image Generation☆2,126Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,108Updated 7 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,036Updated last year
- Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR …☆1,661Updated 4 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,424Updated 4 months ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆935Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆703Updated 5 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆548Updated last year
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,220Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,423Updated 2 years ago
- Segmind Distilled diffusion☆603Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆599Updated 10 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,809Updated 4 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,837Updated 2 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,561Updated last year
- Speed up Stable Diffusion with this one simple trick!☆1,356Updated last year
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆830Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction☆939Updated 7 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,208Updated 11 months ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,902Updated 5 months ago
- T2I-Adapter☆3,695Updated 11 months ago
- VideoSys: An easy and efficient system for video generation☆1,975Updated 3 months ago
- Unified Controllable Visual Generation Model☆644Updated 4 months ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,015Updated 2 years ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆931Updated 7 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆752Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,436Updated 4 months ago
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,285Updated last year