[ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder
☆68Jul 21, 2024Updated last year
Alternatives and similar repositories for DiffDub
Users that are interested in DiffDub are comparing it to the libraries listed below
Sorting:
- Preprocessing Scipts for Talking Face Generation☆94Jan 21, 2025Updated last year
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆215Mar 9, 2024Updated 2 years ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆38Jan 28, 2025Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Oct 24, 2024Updated last year
- ☆379Aug 16, 2024Updated last year
- Official Implementation of "MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation" (AAAI 2025)☆174Jan 14, 2025Updated last year
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆181Mar 26, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆166Mar 31, 2024Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- ☆21Mar 4, 2024Updated 2 years ago
- [ECCV 2024 Oral] EDTalk - Official PyTorch Implementation☆460Sep 29, 2025Updated 5 months ago
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆109Aug 26, 2023Updated 2 years ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 3 years ago
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆108Jul 26, 2024Updated last year
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆123Sep 7, 2025Updated 6 months ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Mar 11, 2025Updated last year
- 🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮☆226Jul 16, 2024Updated last year
- [CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"☆471Jul 15, 2024Updated last year
- Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…☆124Oct 28, 2025Updated 4 months ago
- Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Ky…☆392Oct 12, 2025Updated 5 months ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆76Mar 28, 2024Updated last year
- [CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models☆238Mar 17, 2024Updated 2 years ago
- LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models☆148Jun 11, 2025Updated 9 months ago
- [IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer☆215Mar 21, 2025Updated 11 months ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆395Apr 8, 2025Updated 11 months ago
- The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."☆1,109Sep 25, 2023Updated 2 years ago
- Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"☆229Jun 30, 2023Updated 2 years ago
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- ☆56Jul 9, 2025Updated 8 months ago
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆15Feb 22, 2023Updated 3 years ago
- ☆25Sep 5, 2025Updated 6 months ago
- [ICLR 2024] Generalizable and Precise Head Avatar from Image(s)☆343Nov 1, 2024Updated last year
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆239Nov 12, 2025Updated 4 months ago
- This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation☆137Jan 28, 2026Updated last month
- KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…☆30Oct 28, 2025Updated 4 months ago
- [CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation☆546May 21, 2023Updated 2 years ago
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆18Sep 25, 2025Updated 5 months ago