IS2AI / SpeakingFacesLinks
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
☆85Updated 2 months ago
Alternatives and similar repositories for SpeakingFaces
Users that are interested in SpeakingFaces are comparing it to the libraries listed below
Sorting:
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- Learning Lip Sync of Obama from Speech Audio☆66Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆73Updated 6 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 3 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 6 years ago
- You Said That?: Synthesising Talking Faces from Audio☆70Updated 7 years ago
- This repository contains the code for my master thesis on Emotion-Aware Facial Animation☆147Updated 2 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆110Updated 3 years ago
- ☆21Updated 3 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆67Updated last year
- ☆34Updated 7 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Updated 7 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆92Updated 2 months ago
- ☆20Updated 3 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109Updated 3 years ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆178Updated 2 years ago
- Facial Expression Feature Extractor☆70Updated 2 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆197Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆229Updated 3 years ago
- [NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks☆190Updated 5 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆163Updated 5 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆53Updated last year
- Demo for 2022 ICASSP☆64Updated 3 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Updated last year
- a PyTorch implementation of Lip2Wav☆51Updated 3 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Updated 5 years ago
- Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP☆354Updated 3 years ago