liutaocode/DiffDub

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liutaocode/DiffDub)

liutaocode / DiffDub

[ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder

☆70

Alternatives and similar repositories for DiffDub

Users that are interested in DiffDub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liutaocode / talking_face_preprocessing
View on GitHub
Preprocessing Scipts for Talking Face Generation
☆97Jan 21, 2025Updated last year
semchan / HyperLips
View on GitHub
Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".
☆212Mar 9, 2024Updated 2 years ago
rlgnswk / NeRFFaceSpeech_Code
View on GitHub
One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024
☆65Oct 24, 2024Updated last year
liutaocode / LivePortrait-Train
View on GitHub
Unoffical LivePortrait Training Script [ 🚧 Under Construction]
☆40Jan 28, 2025Updated last year
soumik-kanad / diff2lip
View on GitHub
☆379Aug 16, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
cvlab-kaist / MoDiTalker
View on GitHub
Official Implementation of "MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation" (AAAI 2025)
☆175Jan 14, 2025Updated last year
Songluchuan / AdaSR-TalkingHead
View on GitHub
[ICASSP 2024] Adaptive Super Resolution For One-Shot Talking-Head Generation
☆182Mar 26, 2024Updated 2 years ago
amazon-science / iwslt-autodub-task
View on GitHub
☆21Mar 4, 2024Updated 2 years ago
theEricMa / DiffSpeaker
View on GitHub
DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
☆166Mar 31, 2024Updated 2 years ago
ex3ndr / supervoice-enhance
View on GitHub
Supervoice diffusion enhance
☆28Jul 15, 2024Updated 2 years ago
zsxkib / ST-MFNet
View on GitHub
[IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
☆13Oct 9, 2023Updated 2 years ago
Inferencer / SickFace
View on GitHub
Vid Driven Portrait Animation 🤢😷
☆18Jul 7, 2024Updated 2 years ago
tanshuai0219 / EDTalk
View on GitHub
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
☆468Sep 29, 2025Updated 9 months ago
Elsaam2y / DINet_optimized
View on GitHub
An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …
☆109Aug 26, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vanquish630 / BaldGAN
View on GitHub
Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.
☆12Jun 6, 2022Updated 4 years ago
harisreedhar / Face-Upscalers-ONNX
View on GitHub
ONNX-Powered Inference for State-of-the-Art Face Upscalers
☆112Jul 26, 2024Updated last year
Choddeok / EmoSpherepp
View on GitHub
[TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…
☆129Updated this week
bigai-nlco / IMTalker
View on GitHub
ACM MM | IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
☆186Dec 23, 2025Updated 6 months ago
sowwnn / KFusion-Dual-Domain-for-Speech-to-Landmarks
View on GitHub
KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…
☆32Oct 28, 2025Updated 8 months ago
Inferencer / LipSick
View on GitHub
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
☆225Jul 16, 2024Updated 2 years ago
sstzal / DiffTalk
View on GitHub
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
☆472Jul 15, 2024Updated 2 years ago
CVMI-Lab / Speech2Lip
View on GitHub
[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
☆76Mar 28, 2024Updated 2 years ago
andrerochow / fsrt
View on GitHub
Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…
☆125Oct 28, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cvlab-kaist / GaussianTalker
View on GitHub
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Ky…
☆408Oct 12, 2025Updated 9 months ago
shivangi-aneja / FaceTalk
View on GitHub
[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
☆240Mar 17, 2024Updated 2 years ago
g-milis / NEUTART
View on GitHub
PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.
☆34Mar 11, 2025Updated last year
neeek2303 / EMOPortraits
View on GitHub
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
☆397Apr 8, 2025Updated last year
liutaocode / DiarizationVisualization
View on GitHub
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
MRzzm / DINet
View on GitHub
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
☆1,127Sep 25, 2023Updated 2 years ago
liutaocode / AwesomeDiarizationDataset
View on GitHub
Both audio-only and audio-visual speaker diarization datasets are listed here.
☆16Feb 22, 2023Updated 3 years ago
DanBigioi / DiffusionVideoEditing
View on GitHub
Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"
☆228Jun 30, 2023Updated 3 years ago
SJTU-Lucy / EmoFace
View on GitHub
☆58Jul 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
StelaBou / Diffusion-Act
View on GitHub
☆25Sep 5, 2025Updated 10 months ago
xg-chu / GPAvatar
View on GitHub
[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)
☆346Nov 1, 2024Updated last year
Hanbo-Cheng / DAWN-pytorch
View on GitHub
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
☆234Nov 12, 2025Updated 8 months ago
ykk648 / face_power
View on GitHub
Face_lib separate from AI_Power
☆27Nov 10, 2025Updated 8 months ago
Meta-Portrait / MetaPortrait
View on GitHub
[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
☆542May 21, 2023Updated 3 years ago
ZiqiaoPeng / EmoTalk
View on GitHub
This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
☆140Jan 28, 2026Updated 5 months ago
X-LANCE / AniTalker
View on GitHub
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …
☆1,598Aug 15, 2024Updated last year