IS2AI / SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
☆81Updated 3 years ago
Alternatives and similar repositories for SpeakingFaces:
Users that are interested in SpeakingFaces are comparing it to the libraries listed below
- Facial Expression Feature Extractor☆68Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆105Updated 10 months ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 5 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 3 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆71Updated 5 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago
- Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library☆23Updated 3 years ago
- Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20☆65Updated 3 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆54Updated 3 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- ☆42Updated last year
- Download and preprocess voxceleb datasets.☆28Updated 10 months ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- Implementation for Pre-training strategies and datasets for facial representation learning, ECCV 2022☆70Updated last year
- ☆10Updated 4 months ago
- ☆54Updated last year
- ☆21Updated 3 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆52Updated last year
- This repository contains the code for my master thesis on Emotion-Aware Facial Animation☆147Updated 2 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆196Updated 2 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆83Updated 3 years ago
- ☆50Updated 2 years ago
- The official implementation for ICMI 2020 Best Paper Award "Gesticulator: A framework for semantically-aware speech-driven gesture gener…☆125Updated 2 years ago
- Tools for downloading VoxCeleb2 dataset☆29Updated last year
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 5 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆113Updated 4 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆228Updated last year