X-LANCE / VQTalkerLinks
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
β51Updated 10 months ago
Alternatives and similar repositories for VQTalker
Users that are interested in VQTalker are comparing it to the libraries listed below
Sorting:
- Unoffical LivePortrait Training Script [ π§ Under Construction]β34Updated 9 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementationβ46Updated 3 months ago
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ111Updated 2 months ago
- β20Updated last year
- A novel apporach for personalized speech-driven 3D facial animationβ53Updated last year
- β14Updated 8 months ago
- β51Updated 4 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoderβ66Updated last year
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generationβ61Updated 6 months ago
- β29Updated 4 months ago
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesisβ51Updated 2 weeks ago
- Preprocessing Scipts for Talking Face Generationβ89Updated 9 months ago
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generationβ61Updated 4 months ago
- β24Updated 10 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"β34Updated 3 months ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Videoβ73Updated last year
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Updated last year
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancementβ75Updated last year
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolationβ65Updated 7 months ago
- β17Updated last year
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidanceβ60Updated 3 weeks ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressionsβ¦β22Updated 11 months ago
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β20Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023β30Updated 2 years ago
- β61Updated 3 months ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embeddingβ34Updated last year
- This is official inference code of PD-FGCβ97Updated 2 years ago
- NeurIPS 2022β39Updated 2 years ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023β86Updated 2 years ago
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generationβ180Updated last year