luosiallen / latent-consistency-modelLinks
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
☆4,558Updated last year
Alternatives and similar repositories for latent-consistency-model
Users that are interested in latent-consistency-model are comparing it to the libraries listed below
Sorting:
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,133Updated 8 months ago
- Consistency Distilled Diff VAE☆2,194Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,185Updated 10 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,237Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,941Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,825Updated 7 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,435Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,184Updated last year
- T2I-Adapter☆3,739Updated last year
- Let us democratise high-resolution generation! (CVPR 2024)☆2,027Updated last year
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,880Updated 8 months ago
- Official implementation of DreaMoving☆1,802Updated last year
- Transparent Image Layer Diffusion using Latent Transparency☆2,163Updated last year
- Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR …☆1,678Updated 7 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,961Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,221Updated 7 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,873Updated 5 months ago
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,950Updated last year
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,841Updated 10 months ago
- Unofficial Implementation of Animate Anyone☆2,938Updated last year
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,298Updated last year
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,094Updated last year
- VideoSys: An easy and efficient system for video generation☆2,000Updated 3 weeks ago
- ☆2,461Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,813Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,157Updated last year
- "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)☆2,555Updated last year
- A realtime sketch to image demo using LCM and the gradio library.☆1,791Updated last year
- Nightly release of ControlNet 1.1☆5,087Updated last year
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,598Updated 5 months ago