Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"
☆121Mar 19, 2026Updated this week
Alternatives and similar repositories for X-Dub
Users that are interested in X-Dub are comparing it to the libraries listed below
Sorting:
- [CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding☆76Feb 22, 2026Updated last month
- PyTorch implementation of Tacotron and Tacotron2☆34Jul 19, 2022Updated 3 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆15Jul 12, 2021Updated 4 years ago
- English conversation corpus for conversational TTS.☆21Mar 13, 2023Updated 3 years ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆16Feb 23, 2026Updated 3 weeks ago
- ☆26Feb 7, 2026Updated last month
- ☆11Mar 4, 2025Updated last year
- Official release of StyleTalk dataset.☆72Jul 1, 2024Updated last year
- [CVPR 2026]UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation☆212Jan 29, 2026Updated last month
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆97Nov 9, 2024Updated last year
- [ECCV 2024] RGBD GS-ICP SLAM☆14Nov 5, 2024Updated last year
- official code repository for papar: "Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthes…☆20Jul 29, 2025Updated 7 months ago
- HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)☆14Nov 4, 2025Updated 4 months ago
- Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"☆26Oct 13, 2021Updated 4 years ago
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆17Dec 31, 2024Updated last year
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Sep 13, 2024Updated last year
- ☆11May 9, 2023Updated 2 years ago
- Your AI coworker for any folder: local-first, secure by design, cross-platform, and built for supervised automation.☆85Updated this week
- The pretrained VGG19 mode and scripts for perceptual loss☆22Dec 21, 2020Updated 5 years ago
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- CVPR 24 paper: Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs☆14Mar 19, 2024Updated 2 years ago
- https://wavelandspeech.github.io/☆10Jan 12, 2024Updated 2 years ago
- ☆21Feb 27, 2024Updated 2 years ago
- Gated CNN☆10Jul 17, 2019Updated 6 years ago
- 多变量时序预测transformer☆17Sep 13, 2022Updated 3 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Official Pytorch Implementation of Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model [ECCV'24]☆22Dec 24, 2024Updated last year
- ☆64Sep 18, 2022Updated 3 years ago
- Dataset simulation for DPCCN.☆16Dec 25, 2022Updated 3 years ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 9 months ago
- Official repository for the paper "EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstructio…☆32Jun 15, 2025Updated 9 months ago
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆36Jan 16, 2026Updated 2 months ago
- ☆40Dec 19, 2025Updated 3 months ago
- [NeurIPS 2025] Official implementation for the paper "SeePhys: Does Seeing Help Thinking? -- Benchmarking Vision-Based Physics Reasoning"☆49Sep 19, 2025Updated 6 months ago
- Core codes for "Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration"☆15Jul 3, 2024Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆27Dec 2, 2025Updated 3 months ago