X-LANCE / VQTalkerLinks
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
β53Updated last year
Alternatives and similar repositories for VQTalker
Users that are interested in VQTalker are comparing it to the libraries listed below
Sorting:
- Unoffical LivePortrait Training Script [ π§ Under Construction]β37Updated last year
- β20Updated last year
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ142Updated 3 weeks ago
- [AAAI 2024] stle2talker - Official PyTorch Implementationβ51Updated 5 months ago
- A novel apporach for personalized speech-driven 3D facial animationβ57Updated last year
- Preprocessing Scipts for Talking Face Generationβ92Updated last year
- β14Updated 11 months ago
- β17Updated last year
- β29Updated 7 months ago
- β55Updated 6 months ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Updated 2 years ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Videoβ76Updated last year
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generationβ62Updated 9 months ago
- This is official inference code of PD-FGCβ99Updated 2 years ago
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesisβ54Updated 3 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolationβ69Updated 9 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"β37Updated 6 months ago
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β20Updated last year
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidanceβ61Updated 3 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoderβ68Updated last year
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancementβ75Updated last year
- β25Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023β30Updated 2 years ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressionsβ¦β22Updated last year
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embeddingβ34Updated last year
- β63Updated 2 months ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generationβ63Updated 7 months ago
- [AAAI 2024] SAAS - Official PyTorch Implementationβ11Updated last year
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generationβ180Updated last year
- NeurIPS 2022β39Updated 3 years ago