SpringHuo / MAVDLinks

The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB images and depth images, etc.

☆20

Alternatives and similar repositories for MAVD

Users that are interested in MAVD are comparing it to the libraries listed below

Sorting:

yl4467 / singer
☆14Updated 8 months ago
sectum1919 / cncvs_data_collector
☆26Updated 2 years ago
huifu99 / Mimic
A novel apporach for personalized speech-driven 3D facial animation
☆53Updated last year
vskadandale / vocalist
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
☆68Updated last year
X-LANCE / VQTalker
[AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
☆51Updated 11 months ago
Moon0316 / T2A
Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023
☆86Updated 2 years ago
galib360 / FaceXHuBERT
☆102Updated 3 weeks ago
CVMI-Lab / Speech2Lip
[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
☆73Updated last year
pegahsalehi / Whisper-AFE-TalkingHeadsGen
☆29Updated 4 months ago
xjchenGit / MTDVocaLiST
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
☆25Updated last year
MohammedAlghamdi / talking-heads-acm-mm
Talking Head from Speech Audio using a Pre-trained Image Generator
☆23Updated last year
ZiqiaoPeng / EmoTalk
This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
☆132Updated 4 months ago
on1262 / facialanimation
Source code for: Expressive Speech-driven Facial Animation with controllable emotions
☆39Updated last year
whwjdqls / DEEPTalk
Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]
☆55Updated 9 months ago
SJTU-Lucy / EmoFace
☆51Updated 4 months ago
ms-dot-k / Lip-to-Speech-Synthesis-in-the-Wild
PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)
☆70Updated last year
tanshuai0219 / style2talker
[AAAI 2024] stle2talker - Official PyTorch Implementation
☆46Updated 3 months ago
universome / HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
☆105Updated last year
Dorniwang / PD-FGC-inference
This is official inference code of PD-FGC
☆97Updated 2 years ago
psyai-net / SelfTalk_release
This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …
☆140Updated last year
dc3ea9f / vico_challenge_baseline
☆104Updated 2 years ago
CVI-SZU / DEGSTalk
[ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis
☆51Updated 3 weeks ago
uniBruce / Mead
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]
☆283Updated last year
liutaocode / LivePortrait-Train
Unoffical LivePortrait Training Script [ 🚧 Under Construction]
☆34Updated 9 months ago
DanBigioi / DiffusionVideoEditing
Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"
☆229Updated 2 years ago
liutaocode / talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
☆89Updated 9 months ago
uuembodiedsocialai / ProbTalk3D
☆95Updated 3 months ago
kaist-ami / 3d-talking-head-av-guidance
[INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"
☆18Updated 4 months ago
alvinliu0 / HA2G
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
☆143Updated 2 years ago
FedeNoce / s2l-s2d
[ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation
☆62Updated last year