FreedomIntelligence / TalkVidLinks
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
☆144Updated 3 weeks ago
Alternatives and similar repositories for TalkVid
Users that are interested in TalkVid are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆63Updated 9 months ago
- ☆132Updated last year
- Preprocessing Scipts for Talking Face Generation☆92Updated last year
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Updated 10 months ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆190Updated last year
- [CVPR'25] InsTaG: Learning Personalized 3D Talking Head from Few-Second Video☆163Updated 6 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆53Updated last year
- [CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"☆51Updated 6 months ago
- ☆63Updated 2 months ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆37Updated last year
- Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars☆95Updated last year
- Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…☆123Updated 3 months ago
- ☆55Updated 6 months ago
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement☆75Updated last year
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆51Updated 6 months ago
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis☆54Updated 3 months ago
- [CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation☆191Updated last year
- ☆226Updated last year
- ☆176Updated 2 years ago
- This is official inference code of PD-FGC☆99Updated 2 years ago
- A novel apporach for personalized speech-driven 3D facial animation☆57Updated last year
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Updated 3 months ago
- [AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation☆64Updated 5 months ago
- ☆100Updated 2 months ago
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆166Updated last year
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆180Updated last year
- Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]☆62Updated 11 months ago
- VFHQ-downloader is a Python-based utility designed for the easy downloading and processing of videos from the VFHQ dataset.☆27Updated last year
- Official Implementation of "MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation" (AAAI 2025)☆175Updated last year
- [WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"☆129Updated last year