[ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder
☆68Jul 21, 2024Updated last year
Alternatives and similar repositories for DiffDub
Users that are interested in DiffDub are comparing it to the libraries listed below
Sorting:
- Preprocessing Scipts for Talking Face Generation☆94Jan 21, 2025Updated last year
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆215Mar 9, 2024Updated last year
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆180Mar 26, 2024Updated last year
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 3 years ago
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Oct 24, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- ☆378Aug 16, 2024Updated last year
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆166Mar 31, 2024Updated last year
- [ECCV 2024 Oral] EDTalk - Official PyTorch Implementation☆456Sep 29, 2025Updated 5 months ago
- Official Implementation of "MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation" (AAAI 2025)☆175Jan 14, 2025Updated last year
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆119Sep 7, 2025Updated 5 months ago
- 🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮☆225Jul 16, 2024Updated last year
- [CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"☆471Jul 15, 2024Updated last year
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆38Jan 28, 2025Updated last year
- ONNX-Powered Inference for State-of-the-Art Face Upscalers☆108Jul 26, 2024Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆76Mar 28, 2024Updated last year
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Mar 11, 2025Updated 11 months ago
- This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation☆138Jan 28, 2026Updated last month
- Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".☆300May 30, 2025Updated 9 months ago
- ☆21Mar 4, 2024Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- ☆56Jul 9, 2025Updated 7 months ago
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆109Aug 26, 2023Updated 2 years ago
- ☆25Sep 5, 2025Updated 5 months ago
- [CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models☆237Mar 17, 2024Updated last year
- Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…☆124Oct 28, 2025Updated 4 months ago
- LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models☆148Jun 11, 2025Updated 8 months ago
- Face_lib separate from AI_Power☆28Nov 10, 2025Updated 3 months ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Ky…☆390Oct 12, 2025Updated 4 months ago
- [CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation☆546May 21, 2023Updated 2 years ago
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆14Sep 20, 2024Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated last week
- example apps for inference.sh☆19Feb 21, 2026Updated last week
- ☆10Nov 19, 2023Updated 2 years ago
- [ICLR 2024] Generalizable and Precise Head Avatar from Image(s)☆342Nov 1, 2024Updated last year