dvlab-research / ControlNeXtLinks

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

☆1,626

Alternatives and similar repositories for ControlNeXt

Users that are interested in ControlNeXt are comparing it to the libraries listed below

Sorting:

Yuanshi9815 / OminiControl
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,862Updated 5 months ago
MyNiuuu / MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆759Updated last year
TencentARC / MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
☆1,473Updated 10 months ago
aigc-apps / VideoX-Fun
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆1,701Updated this week
alimama-creative / FLUX-Controlnet-Inpainting
☆785Updated last year
ali-vilab / In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
☆2,043Updated last year
Vchitect / VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,373Updated last week
huggingface / finetrainers
Scalable and memory-optimized training of diffusion models
☆1,312Updated 6 months ago
pixeli99 / SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
☆723Updated last year
alibaba / animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
☆954Updated last year
Alpha-VLLM / Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,242Updated 10 months ago
aigc-apps / EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
☆2,239Updated 9 months ago
TIGER-AI-Lab / AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
☆640Updated last year
open-mmlab / PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting…
☆1,016Updated 2 weeks ago
PixArt-alpha / PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
☆1,882Updated last year
FoundationVision / Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,528Updated last month
mayuelala / FollowYourClick
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via S…
☆908Updated 3 months ago
fallenshock / FlowEdit
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
☆890Updated last month
Picsart-AI-Research / StreamingT2V
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
☆1,617Updated 8 months ago
liming-ai / ControlNet_Plus_Plus
[ECCV 2024] ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
☆535Updated 11 months ago
open-mmlab / PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos…
☆977Updated last year
megvii-research / HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
☆834Updated last year
lehduong / OneDiffusion
Official implementation of OneDiffusion paper (CVPR 2025)
☆660Updated last year
TencentQQGYLab / ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,270Updated last year
tianweiy / DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,130Updated 9 months ago
HL-hanlin / Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR …
☆462Updated 10 months ago
catcathh / UltraPixel
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
☆612Updated last year
kongzhecn / OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
☆700Updated last year
VideoVerses / VideoTuna
Let's finetune video generation models!
☆531Updated 3 months ago
FireRedTeam / StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
☆719Updated last year