menyifang / MIMOLinks
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
☆1,563Updated 6 months ago
Alternatives and similar repositories for MIMO
Users that are interested in MIMO are comparing it to the libraries listed below
Sorting:
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,495Updated last month
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆1,033Updated 6 months ago
- Diffusion-based Portrait and Animal Animation☆849Updated 3 weeks ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,608Updated 4 months ago
- Select a portrait, click to move the head around (please use your own space / GPU!)☆904Updated 4 months ago
- [CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation☆1,435Updated last month
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,867Updated 5 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,242Updated 9 months ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,638Updated 9 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,469Updated 3 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆2,045Updated last year
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation☆1,637Updated 3 months ago
- [ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2)…☆1,559Updated 2 weeks ago
- ☆2,566Updated last year
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,057Updated 2 months ago
- ☆1,044Updated 7 months ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,617Updated 9 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,196Updated 2 months ago
- StoryMaker: Towards consistent characters in text-to-image generation☆717Updated last year
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation☆783Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆759Updated last year
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆2,021Updated last month
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,605Updated last year
- Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"☆532Updated 2 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,758Updated 7 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,505Updated 5 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,681Updated last month
- ComfyUI nodes for LivePortrait☆2,108Updated last year
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,321Updated 2 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,142Updated last year