chaolongy/KDTalker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chaolongy/KDTalker)

chaolongy / KDTalker

[IJCV 2025] Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait

☆308

Alternatives and similar repositories for KDTalker

Users that are interested in KDTalker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hanbo-Cheng / DAWN-pytorch
View on GitHub
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
☆234Nov 12, 2025Updated 8 months ago
antonibigata / keysync
View on GitHub
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
☆395Jan 23, 2026Updated 6 months ago
Fictionarry / InsTaG
View on GitHub
[CVPR'25] InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
☆175Jul 15, 2025Updated last year
harlanhong / ACTalker
View on GitHub
ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…
☆461Aug 20, 2025Updated 11 months ago
antonibigata / keyface_cvpr
View on GitHub
[CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
☆72Apr 8, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Fantasy-AMAP / fantasy-talking
View on GitHub
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
☆1,623Jan 26, 2026Updated 6 months ago
memoavatar / memo
View on GitHub
[TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation
☆1,069Aug 6, 2025Updated 11 months ago
deepbrainai-research / float
View on GitHub
[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
☆487Nov 10, 2025Updated 8 months ago
ZiqiaoPeng / SyncTalk_2D
View on GitHub
A 2D customized lip-sync model for high-fidelity real-time driving.
☆133Jun 26, 2025Updated last year
wyhsirius / LIA-X
View on GitHub
LIA-X: Interpretable Latent Portrait Animator
☆105Sep 17, 2025Updated 10 months ago
SkyworkAI / SkyReels-A1
View on GitHub
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
☆581Jun 5, 2025Updated last year
warmshao / FasterLivePortrait
View on GitHub
Bring portraits to life in Real Time！onnx/tensorrt support！实时肖像驱动！
☆1,161Jun 29, 2025Updated last year
bytedance / X-Portrait
View on GitHub
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
☆541Oct 14, 2025Updated 9 months ago
kkakkkka / HunyuanPortraitLCM
View on GitHub
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
☆283Mar 14, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jdh-algo / JoyVASA
View on GitHub
Diffusion-based Portrait and Animal Animation
☆874Apr 16, 2026Updated 3 months ago
Songluchuan / AdaSR-TalkingHead
View on GitHub
[ICASSP 2024] Adaptive Super Resolution For One-Shot Talking-Head Generation
☆182Mar 26, 2024Updated 2 years ago
fudan-generative-vision / hallo3
View on GitHub
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
☆1,395Mar 13, 2025Updated last year
uuembodiedsocialai / ProbTalk3D
View on GitHub
☆104Nov 26, 2025Updated 8 months ago
Tencent-Hunyuan / HunyuanPortrait
View on GitHub
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
☆345Dec 7, 2025Updated 7 months ago
Omni-Avatar / OmniAvatar
View on GitHub
☆1,851Aug 6, 2025Updated 11 months ago
hanquansanren / DvD
View on GitHub
[SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…
☆33Mar 10, 2026Updated 4 months ago
lihxxx / DisPose
View on GitHub
[ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
☆376Nov 20, 2025Updated 8 months ago
xg-chu / ARTalk
View on GitHub
ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.
☆137Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NetEase-Media / ControlTalk
View on GitHub
Official code for "Controllable Talking Face Generation by Implicit Facial Keypoints Editing"
☆34Oct 31, 2024Updated last year
xg-chu / GAGAvatar
View on GitHub
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
☆588Updated this week
bytedance / X-Dyna
View on GitHub
[CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation
☆268Jan 30, 2025Updated last year
asw91666 / TRG-Release
View on GitHub
Official PyTorch implementation of "6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry," ECCV 2024
☆106Jun 17, 2025Updated last year
antgroup / ditto-talkinghead
View on GitHub
[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
☆846Nov 12, 2025Updated 8 months ago
MeiGen-AI / llia
View on GitHub
LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
☆152Jun 11, 2025Updated last year
pegahsalehi / Whisper-AFE-TalkingHeadsGen
View on GitHub
☆31Jun 30, 2025Updated last year
xyz123xyz456 / hallo4
View on GitHub
☆61Dec 1, 2025Updated 7 months ago
kaist-ami / MultiTalk
View on GitHub
[INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…
☆195Nov 5, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bytedance / LatentSync
View on GitHub
Taming Stable Diffusion for Lip Sync!
☆5,931Jun 20, 2025Updated last year
viddle-app / animatediff
View on GitHub
Animatediff implementation. Includes a ControlNet pipeline.
☆19Dec 24, 2023Updated 2 years ago
CVI-SZU / DEGSTalk
View on GitHub
[ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis
☆55Oct 25, 2025Updated 9 months ago
neeek2303 / EMOPortraits
View on GitHub
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
☆397Apr 8, 2025Updated last year
toto222 / DICE-Talk
View on GitHub
DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…
☆305Aug 7, 2025Updated 11 months ago
morphicfilms / frames-to-video
View on GitHub
☆180Nov 8, 2025Updated 8 months ago
zsxkib / ST-MFNet
View on GitHub
[IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
☆13Oct 9, 2023Updated 2 years ago