IS2AI / SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
☆81Updated 3 years ago
Alternatives and similar repositories for SpeakingFaces:
Users that are interested in SpeakingFaces are comparing it to the libraries listed below
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- Download and preprocess voxceleb datasets.☆29Updated 11 months ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 5 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆106Updated 11 months ago
- Facial Expression Feature Extractor☆68Updated 2 years ago
- ☆35Updated 6 years ago
- Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library☆24Updated 3 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆66Updated last year
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year
- This github contains the network architectures of NeuralVoicePuppetry.☆80Updated 4 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 3 years ago
- ☆21Updated 3 years ago
- Tools for downloading VoxCeleb2 dataset☆29Updated last year
- ☆10Updated 5 months ago
- Talking Face Generation by Conditional Recurrent Adversarial Network☆61Updated 5 years ago
- Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20☆65Updated 3 years ago
- ☆42Updated last year
- This repository contains scripts to build Youtube Gesture Dataset.☆123Updated last year
- Information Distillation Generative Adversrial Network in PyTorch☆27Updated 5 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆83Updated 3 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆54Updated 3 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆71Updated 5 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 2 years ago
- ☆19Updated 3 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆23Updated last year
- A tool for facial action unit analysis☆38Updated last year
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆63Updated 2 months ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- ☆8Updated 2 years ago