X-LANCE / VQTalkerLinks
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
β53Updated last year
Alternatives and similar repositories for VQTalker
Users that are interested in VQTalker are comparing it to the libraries listed below
Sorting:
- TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesisβ142Updated 3 weeks ago
- Unoffical LivePortrait Training Script [ π§ Under Construction]β37Updated last year
- [AAAI 2024] stle2talker - Official PyTorch Implementationβ51Updated 5 months ago
- A novel apporach for personalized speech-driven 3D facial animationβ57Updated last year
- Preprocessing Scipts for Talking Face Generationβ92Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β20Updated last year
- β55Updated 6 months ago
- β20Updated last year
- β29Updated 7 months ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Videoβ76Updated last year
- β14Updated 11 months ago
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancementβ75Updated last year
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Updated 2 years ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generationβ62Updated 9 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"β37Updated 6 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolationβ69Updated 9 months ago
- β17Updated last year
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesisβ54Updated 3 months ago
- NeurIPS 2022β39Updated 3 years ago
- β27Updated 2 years ago
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidanceβ61Updated 3 months ago
- β25Updated last year
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generationβ63Updated 7 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoderβ68Updated last year
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressionsβ¦β22Updated last year
- This is official inference code of PD-FGCβ99Updated 2 years ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embeddingβ34Updated last year
- Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatarsβ94Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023β30Updated 2 years ago
- β63Updated 2 months ago