dvlab-research / ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,255Updated last week
Related projects: ⓘ
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆576Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,155Updated 3 weeks ago
- Fine-Grained Open Domain Image Animation with Motion Guidance☆716Updated last month
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,048Updated 2 months ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆735Updated last month
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"☆453Updated 3 weeks ago
- ICLR 2024 (Spotlight)☆712Updated 6 months ago
- [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos…☆876Updated last month
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,263Updated last month
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,020Updated last month
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,604Updated last month
- ☆434Updated this week
- Latte: Latent Diffusion Transformer for Video Generation.☆1,637Updated last week
- Concept Sliders for Precise Control of Diffusion Models☆922Updated last week
- Stable Video Diffusion Training Code and Extensions.☆560Updated last month
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,683Updated 3 months ago
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.☆793Updated 3 weeks ago
- Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks☆444Updated 2 weeks ago
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction☆887Updated 8 months ago
- LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆835Updated 3 weeks ago
- Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."☆522Updated 3 weeks ago
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models☆614Updated 2 months ago
- [ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting…☆556Updated last week
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion☆674Updated 2 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,597Updated 2 months ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆467Updated 2 months ago
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆476Updated 8 months ago
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation☆711Updated 3 months ago
- Transparent Image Layer Diffusion using Latent Transparency☆1,983Updated 3 months ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training☆334Updated 2 months ago