Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,636Sep 25, 2024Updated last year
Alternatives and similar repositories for ControlNeXt
Users that are interested in ControlNeXt are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆768Dec 5, 2024Updated last year
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,905Jul 3, 2025Updated 8 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,546Nov 18, 2025Updated 4 months ago
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR …☆471Feb 11, 2025Updated last year
- ☆190Aug 15, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,252Mar 6, 2025Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆3,003Sep 8, 2024Updated last year
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆514Jun 17, 2025Updated 9 months ago
- ☆387Jun 6, 2024Updated last year
- Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks☆614Sep 27, 2024Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,495Feb 19, 2025Updated last year
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 6 months ago
- [NeurIPS D&B Track 2024] Official implementation of HumanVid☆349Oct 14, 2025Updated 5 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,495Jun 28, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,298Nov 27, 2025Updated 3 months ago
- ☆470Feb 12, 2024Updated 2 years ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,154Jan 10, 2025Updated last year
- ☆2,233Nov 8, 2024Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,171Dec 21, 2024Updated last year
- ☆644May 24, 2024Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,532Nov 4, 2025Updated 4 months ago
- Stable Video Diffusion Training Code and Extensions.☆734Jul 25, 2024Updated last year
- Kolors Team☆4,608Nov 13, 2024Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)☆3,496May 31, 2024Updated last year
- Official implementation of AnimateDiff.☆12,067Jul 31, 2024Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,660Mar 5, 2025Updated last year
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆505Jul 2, 2024Updated last year
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆130Jul 5, 2024Updated last year
- More relighting!☆8,388Feb 20, 2025Updated last year
- [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745☆257Apr 19, 2025Updated 11 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆514Dec 11, 2024Updated last year
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion☆777Jul 3, 2024Updated last year
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,189Apr 15, 2025Updated 11 months ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆569Sep 16, 2024Updated last year
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,963Updated this week
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,524Updated this week
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,575Jun 19, 2025Updated 9 months ago