Yubo-Shankui / Bind-Your-Avatar-ImplementationLinks
Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Router"
☆32Updated 4 months ago
Alternatives and similar repositories for Bind-Your-Avatar-Implementation
Users that are interested in Bind-Your-Avatar-Implementation are comparing it to the libraries listed below
Sorting:
- ☆132Updated 7 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated last year
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Updated 3 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Updated last month
- VideoCoF: Unified Video Editing with Temporal Reasoner☆129Updated last month
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆114Updated 8 months ago
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆36Updated 2 months ago
- The official UniVerse-1 code.☆119Updated 3 months ago
- Implementation Code for Omni-Effects☆173Updated last month
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆51Updated last week
- Muti-human Interactive Talking Dataset☆67Updated 5 months ago
- DiT for VAE (and Video Generation)☆35Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Updated 6 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38Updated 8 months ago
- ☆141Updated 3 months ago
- ☆32Updated 10 months ago
- [CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation☆82Updated 10 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆39Updated 6 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 8 months ago
- ☆85Updated 3 months ago
- ☆52Updated 3 weeks ago
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆65Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Updated 6 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆49Updated 9 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated last year
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆78Updated 5 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆76Updated last month
- ThinkGen: Generalized Thinking for Visual Generation☆46Updated last month
- ☆92Updated 5 months ago