IS2AI / SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
☆78Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for SpeakingFaces
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆120Updated 2 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆103Updated 5 months ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- Facial Expression Feature Extractor☆67Updated 2 years ago
- Download and preprocess voxceleb datasets.☆22Updated 5 months ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year
- ☆10Updated last week
- Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20☆63Updated 3 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆36Updated 2 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆87Updated last year
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆30Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆68Updated 5 years ago
- This is the official implementation for IVA'20 Best Paper Award paper "Let's Face It: Probabilistic Multi-modal Interlocutor-aware Gener…☆16Updated last year
- Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library☆21Updated 3 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆87Updated 2 years ago
- Tools for downloading VoxCeleb2 dataset☆26Updated 8 months ago
- This github contains the network architectures of NeuralVoicePuppetry.☆78Updated 3 years ago
- ☆35Updated 6 years ago
- ☆39Updated last year
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆154Updated 4 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆126Updated last year
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated 8 months ago
- Talking Face Generation by Conditional Recurrent Adversarial Network☆61Updated 4 years ago
- This repository contains the gesture generation model from the paper "Moving Fast and Slow" (https://www.tandfonline.com/doi/full/10.1080…☆25Updated last year
- Demo for 2022 ICASSP☆64Updated 2 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Updated 3 years ago
- DeepFaceFlow: In-the-wild Dense 3D Facial Motion Estimation☆80Updated 4 years ago
- The official implementation for ICMI 2020 Best Paper Award "Gesticulator: A framework for semantically-aware speech-driven gesture gener…☆122Updated last year