[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
☆744Feb 21, 2026Updated 3 weeks ago
Alternatives and similar repositories for Stand-In
Users that are interested in Stand-In are comparing it to the libraries listed below
Sorting:
- The core component of Stand-In, the preprocessor, is essential—only images processed through it can fully unlock the capabilities of Stan…☆156Aug 21, 2025Updated 6 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆503Aug 20, 2025Updated 7 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,699Oct 17, 2025Updated 5 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆674Feb 13, 2026Updated last month
- We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven av…☆1,215Jan 20, 2026Updated 2 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)☆291Mar 12, 2026Updated last week
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,622Jan 26, 2026Updated last month
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,498Sep 11, 2025Updated 6 months ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆446Aug 20, 2025Updated 7 months ago
- [NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulatio…☆624Oct 22, 2025Updated 4 months ago
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,963Updated this week
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆842Apr 27, 2025Updated 10 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆338Oct 14, 2025Updated 5 months ago
- Lynx: Towards High-Fidelity Personalized Video Generation☆314Feb 27, 2026Updated 3 weeks ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,211Oct 15, 2025Updated 5 months ago
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,840Dec 18, 2025Updated 3 months ago
- ☆6,193Feb 22, 2026Updated 3 weeks ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,186Jan 25, 2026Updated last month
- Code for CineScale, higher-resolution video generation based on Wan☆185Aug 25, 2025Updated 6 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆239Aug 22, 2025Updated 6 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆92Sep 11, 2025Updated 6 months ago
- A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captionin…☆147Aug 28, 2025Updated 6 months ago
- Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations☆889Feb 23, 2026Updated 3 weeks ago
- SkyReels-A2: Compose anything in video diffusion transformers☆706Jun 3, 2025Updated 9 months ago
- Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944☆342Aug 7, 2025Updated 7 months ago
- [CVPR 2026] High-Quality Text-to-Video Generation with Alpha Channel☆342Mar 10, 2026Updated last week
- ☆1,053May 14, 2025Updated 10 months ago
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)☆1,722Jul 25, 2025Updated 7 months ago
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.☆278Feb 3, 2026Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,252Mar 6, 2025Updated last year
- [ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing☆556Aug 20, 2025Updated 7 months ago
- ComfyUI nodes for WanAnimate model input preprocessing☆480Dec 22, 2025Updated 2 months ago
- ☆724Nov 7, 2025Updated 4 months ago
- ☆86Nov 16, 2025Updated 4 months ago
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆3,203Jan 8, 2026Updated 2 months ago
- [SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization☆1,728Aug 14, 2025Updated 7 months ago
- Light Image Video Generation Inference Framework☆2,062Mar 13, 2026Updated last week
- ☆1,745Mar 6, 2026Updated 2 weeks ago
- ☆1,804Aug 6, 2025Updated 7 months ago