X-LANCE / VQTalkerLinks
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
β52Updated 11 months ago
Alternatives and similar repositories for VQTalker
Users that are interested in VQTalker are comparing it to the libraries listed below
Sorting:
- Unoffical LivePortrait Training Script [ π§ Under Construction]β35Updated 10 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementationβ46Updated 3 months ago
- β14Updated 9 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ117Updated 2 weeks ago
- β51Updated 4 months ago
- Preprocessing Scipts for Talking Face Generationβ90Updated 10 months ago
- A novel apporach for personalized speech-driven 3D facial animationβ55Updated last year
- β20Updated last year
- β29Updated 5 months ago
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β20Updated last year
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoderβ67Updated last year
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolationβ67Updated 7 months ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Videoβ74Updated last year
- β24Updated 11 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generationβ62Updated 7 months ago
- β17Updated last year
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancementβ75Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023β30Updated 2 years ago
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesisβ52Updated last month
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generationβ61Updated 5 months ago
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidanceβ60Updated last month
- NeurIPS 2022β39Updated 3 years ago
- This is official inference code of PD-FGCβ97Updated 2 years ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embeddingβ34Updated last year
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"β34Updated 4 months ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023β86Updated 2 years ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressionsβ¦β22Updated 11 months ago
- β61Updated 4 months ago
- [AAAI 2024] SAAS - Official PyTorch Implementationβ10Updated last year