TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
☆146Jan 11, 2026Updated last month
Alternatives and similar repositories for TalkVid
Users that are interested in TalkVid are comparing it to the libraries listed below
Sorting:
- The official SpeakerVid-5M data curation code.☆68Jul 23, 2025Updated 7 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆460Nov 10, 2025Updated 3 months ago
- ☆13Mar 8, 2024Updated last year
- ☆20Sep 11, 2024Updated last year
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆422May 12, 2024Updated last year
- Implicit Motion Function - (unofficial) Microsoft recreation☆26Nov 19, 2024Updated last year
- ☆25Dec 19, 2024Updated last year
- ☆46Jun 24, 2025Updated 8 months ago
- ☆102Nov 26, 2025Updated 3 months ago
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆343Mar 11, 2025Updated 11 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- ☆29Nov 19, 2025Updated 3 months ago
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"☆66Jun 6, 2025Updated 8 months ago
- 💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this re…☆1,440Nov 6, 2025Updated 3 months ago
- [SIGGRAPH Asia 2025] Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization☆35Nov 30, 2025Updated 3 months ago
- KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…☆30Oct 28, 2025Updated 4 months ago
- LIA-X: Interpretable Latent Portrait Animator☆98Sep 17, 2025Updated 5 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆376Jan 23, 2026Updated last month
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆708Nov 12, 2025Updated 3 months ago
- ☆17Jul 23, 2025Updated 7 months ago
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 5 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated 10 months ago
- ☆30Jun 30, 2025Updated 8 months ago
- ☆133Jul 8, 2024Updated last year
- Unofficial implementation of MIMO (MImicking anyone anywhere with complex Motions and Object interactions)☆10Nov 22, 2024Updated last year
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- ☆18Jan 7, 2026Updated last month
- ☆17Apr 7, 2025Updated 10 months ago
- unofficial Split Mean Flow Implementation from bytedance☆66Aug 12, 2025Updated 6 months ago
- Repository for the paper "3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow", CVPR 2024☆41Dec 16, 2024Updated last year
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,368Mar 13, 2025Updated 11 months ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆190Nov 5, 2024Updated last year
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- A 2D customized lip-sync model for high-fidelity real-time driving.☆125Jun 26, 2025Updated 8 months ago
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 5 months ago
- ☆16Nov 30, 2021Updated 4 years ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆110Updated this week
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆21Feb 28, 2025Updated last year
- ComfyUI Workflow Collection | ComfyUI 工作流合集☆18Dec 6, 2024Updated last year