Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"
☆195May 15, 2026Updated last week
Alternatives and similar repositories for X-Dub
Users that are interested in X-Dub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Continuous-Time Distribution Matching for Few-Step Diffusion Distillation👏☆132May 11, 2026Updated 2 weeks ago
- [CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding☆81Feb 22, 2026Updated 3 months ago
- SONIC: Spectral Optimization of Noise for Inpainting with Consistency☆23Jan 4, 2026Updated 4 months ago
- PyTorch implementation of Tacotron and Tacotron2☆34Jul 19, 2022Updated 3 years ago
- MLX SAM3 - MLX Port of SAM3 for interactive image segmentation with Text and Geometric Prompts☆34Jan 23, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multi-agent end-to-end application - General-purpose artificial intelligence agent for multimodal agent collaboration☆133May 15, 2026Updated last week
- ☆32Dec 14, 2025Updated 5 months ago
- A set of solutions is provided, leveraging Openpangu - 7B as the base model for fine - tuning and application of large language models (L…☆490Mar 30, 2026Updated last month
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆15Jul 12, 2021Updated 4 years ago
- English conversation corpus for conversational TTS.☆21Mar 13, 2023Updated 3 years ago
- Extract structured knowledge from Cursor & Claude Code conversations into git-trackable Markdown files☆45May 7, 2026Updated 2 weeks ago
- Clouds Coder is a local-first coding agent platform centered on separating the CLI execution plane from the Web user plane, with Web UI, …☆472May 10, 2026Updated 2 weeks ago
- ☆11Mar 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Windows 本地 AI 语音输入法 — 离线语音识别,屏幕感知上下文,热词纠错,语音指令操作选区。Local AI voice typing for Windows — offline speech-to-text with screen-aware context.☆67Apr 17, 2026Updated last month
- ☆18Aug 9, 2018Updated 7 years ago
- Official release of StyleTalk dataset.☆72Jul 1, 2024Updated last year
- ☆31Feb 7, 2026Updated 3 months ago
- 股票、期货实时数据与模拟盘自动化公开演示版☆123Mar 26, 2026Updated last month
- ACM TOG 2025🎉 Offical repository for "B4M: Breaking Low-Rank Adapter for Making Content-Style Customization"☆68Feb 9, 2026Updated 3 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Apr 29, 2026Updated 3 weeks ago
- The first hardware product built for OpenClaw. A wearable AI clip ecosystem — hardware, firmware, and plugin.☆94May 17, 2026Updated last week
- [CVPR 2026]UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation☆215Jan 29, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆33Oct 28, 2025Updated 6 months ago
- [ICLR 2026] Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models☆56Mar 3, 2026Updated 2 months ago
- PaperChain is an AI-driven academic writing pipeline that helps generate structured papers from research ideas to fully formatted Word do…☆85Mar 8, 2026Updated 2 months ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆88Dec 20, 2022Updated 3 years ago
- ICASSP2026 HumDial Challenge☆45Dec 13, 2025Updated 5 months ago
- People who suffer from low vision, sight and visual impairment are not able to see words and letters in ordinary newsprint, books and mag…☆10Oct 1, 2020Updated 5 years ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆99Nov 9, 2024Updated last year
- Literature-grounded research idea exploration for CLI agents. 文献驱动的研究选题与方向探索工具。☆140Apr 2, 2026Updated last month
- [ECCV 2024] RGBD GS-ICP SLAM☆14Nov 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Awesome papers for affective computing with llm and mllm☆24Nov 26, 2025Updated 5 months ago
- On-device assistive vision app for iPhone☆103Apr 17, 2026Updated last month
- A clean-room terminal coding agent built from Claude Code architecture research.☆53Apr 5, 2026Updated last month
- HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)☆14Nov 4, 2025Updated 6 months ago
- [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆172Nov 18, 2025Updated 6 months ago
- [ICML 2026] LaST$_0$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model