X-LANCE / VQTalkerLinks
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
β50Updated 8 months ago
Alternatives and similar repositories for VQTalker
Users that are interested in VQTalker are comparing it to the libraries listed below
Sorting:
- Unoffical LivePortrait Training Script [ π§ Under Construction]β34Updated 7 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementationβ46Updated last month
- β14Updated 6 months ago
- A novel apporach for personalized speech-driven 3D facial animationβ53Updated last year
- Preprocessing Scipts for Talking Face Generationβ90Updated 7 months ago
- β28Updated 2 months ago
- β20Updated last year
- Latent Diffusion Transformer for Talking Video Synthesisβ60Updated 9 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generationβ61Updated 4 months ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Videoβ73Updated last year
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoderβ65Updated last year
- β48Updated 2 months ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Updated last year
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ80Updated last week
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β20Updated last year
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesisβ50Updated 7 months ago
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancementβ74Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023β30Updated 2 years ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolationβ58Updated 5 months ago
- β16Updated last year
- β24Updated 8 months ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressionsβ¦β22Updated 9 months ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embeddingβ33Updated last year
- β60Updated last month
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generationβ59Updated 2 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"β34Updated last month
- β24Updated 2 years ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).β25Updated last year
- NeurIPS 2022β38Updated 2 years ago
- This is official inference code of PD-FGCβ94Updated last year