TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
☆167Jan 11, 2026Updated 3 months ago
Alternatives and similar repositories for TalkVid
Users that are interested in TalkVid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official SpeakerVid-5M data curation code.☆73Jul 23, 2025Updated 9 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆472Nov 10, 2025Updated 5 months ago
- ☆20Sep 11, 2024Updated last year
- ☆16Mar 8, 2024Updated 2 years ago
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆427May 12, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implicit Motion Function - (unofficial) Microsoft recreation☆29Nov 19, 2024Updated last year
- LIA-X: Interpretable Latent Portrait Animator☆101Sep 17, 2025Updated 7 months ago
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆350Mar 11, 2025Updated last year
- ☆102Nov 26, 2025Updated 5 months ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆194Nov 5, 2024Updated last year
- 💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this re…☆1,487Apr 18, 2026Updated last week
- ☆133Jul 8, 2024Updated last year
- KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…☆30Oct 28, 2025Updated 6 months ago
- ☆25Dec 19, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar☆74Mar 13, 2025Updated last year
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆70Apr 8, 2025Updated last year
- ☆30Jun 30, 2025Updated 9 months ago
- 🎓 Update Talking-Face Research Papers Daily☆428Updated this week
- Repository for the paper "3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow", CVPR 2024☆41Dec 16, 2024Updated last year
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,382Mar 13, 2025Updated last year
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆763Nov 12, 2025Updated 5 months ago
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 7 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆384Jan 23, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [SIGGRAPH Asia 2025] Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization☆37Nov 30, 2025Updated 4 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆64Apr 23, 2025Updated last year
- Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos☆299Mar 24, 2025Updated last year
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- Official Code Base of the Paper: "Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions"☆53Jan 31, 2025Updated last year
- ☆179Jul 12, 2023Updated 2 years ago
- Out of time: automated lip sync in the wild☆883Apr 17, 2026Updated last week
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆22Feb 28, 2025Updated last year
- ComfyUI Workflow Collection | ComfyUI 工作流合集☆21Dec 6, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Expressive Gaussian Human Avatars from Monocular RGB Video (NeurIPS 2024)☆56May 28, 2025Updated 11 months ago
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- unofficial Split Mean Flow Implementation from bytedance☆70Aug 12, 2025Updated 8 months ago
- Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Ky…☆400Oct 12, 2025Updated 6 months ago
- ☆201Apr 11, 2024Updated 2 years ago
- FLAME head tracker for single image reconstruction and monocular video tracking. [Note: This tracker operates offline and is not intended…☆144Nov 30, 2025Updated 4 months ago
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"☆65Jun 6, 2025Updated 10 months ago