[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
☆731Feb 21, 2026Updated last week
Alternatives and similar repositories for Stand-In
Users that are interested in Stand-In are comparing it to the libraries listed below
Sorting:
- The core component of Stand-In, the preprocessor, is essential—only images processed through it can fully unlock the capabilities of Stan…☆154Aug 21, 2025Updated 6 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆500Aug 20, 2025Updated 6 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,648Oct 17, 2025Updated 4 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Feb 13, 2026Updated 2 weeks ago
- We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven av…☆1,206Jan 20, 2026Updated last month
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆287Jan 13, 2026Updated last month
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,481Sep 11, 2025Updated 5 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,619Jan 26, 2026Updated last month
- [NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulatio…☆620Oct 22, 2025Updated 4 months ago
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,912Updated this week
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆834Apr 27, 2025Updated 10 months ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆445Aug 20, 2025Updated 6 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,205Oct 15, 2025Updated 4 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆335Oct 14, 2025Updated 4 months ago
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,813Dec 18, 2025Updated 2 months ago
- Lynx: Towards High-Fidelity Personalized Video Generation☆309Sep 26, 2025Updated 5 months ago
- ☆130Dec 24, 2025Updated 2 months ago
- ☆6,089Feb 16, 2026Updated last week
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,155Jan 25, 2026Updated last month
- Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944☆336Aug 7, 2025Updated 6 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆701Jun 3, 2025Updated 8 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆232Aug 22, 2025Updated 6 months ago
- ☆1,049May 14, 2025Updated 9 months ago
- ☆721Nov 7, 2025Updated 3 months ago
- ☆1,703Feb 19, 2026Updated last week
- A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captionin…☆146Aug 28, 2025Updated 6 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,250Mar 6, 2025Updated 11 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆90Sep 11, 2025Updated 5 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆183Aug 25, 2025Updated 6 months ago
- [SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization☆1,744Aug 14, 2025Updated 6 months ago
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.☆279Feb 3, 2026Updated 3 weeks ago
- ☆1,357Apr 21, 2025Updated 10 months ago
- [ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing☆544Aug 20, 2025Updated 6 months ago
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆3,193Jan 8, 2026Updated last month
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆283Feb 19, 2026Updated last week
- ComfyUI nodes for WanAnimate model input preprocessing☆468Dec 22, 2025Updated 2 months ago
- ☆1,790Aug 6, 2025Updated 6 months ago
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)☆1,715Jul 25, 2025Updated 7 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,746Nov 28, 2025Updated 3 months ago