X-LANCE / VQTalkerLinks
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
β51Updated 10 months ago
Alternatives and similar repositories for VQTalker
Users that are interested in VQTalker are comparing it to the libraries listed below
Sorting:
- Unoffical LivePortrait Training Script [ π§ Under Construction]β34Updated 8 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementationβ46Updated 2 months ago
- β20Updated last year
- A novel apporach for personalized speech-driven 3D facial animationβ53Updated last year
- Preprocessing Scipts for Talking Face Generationβ89Updated 9 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ103Updated last month
- β50Updated 3 months ago
- β14Updated 8 months ago
- β29Updated 3 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoderβ65Updated last year
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generationβ62Updated 6 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolationβ64Updated 6 months ago
- β17Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Videoβ73Updated last year
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancementβ75Updated last year
- β24Updated 10 months ago
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesisβ50Updated 9 months ago
- β61Updated 3 months ago
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β20Updated last year
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressionsβ¦β22Updated 10 months ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Updated last year
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidanceβ59Updated this week
- This is official inference code of PD-FGCβ94Updated 2 years ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"β34Updated 3 months ago
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023β30Updated 2 years ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023β86Updated 2 years ago
- NeurIPS 2022β39Updated 2 years ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generationβ59Updated 3 months ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embeddingβ34Updated last year
- Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]β53Updated 8 months ago