IS2AI / SpeakingFacesLinks
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
☆82Updated 4 years ago
Alternatives and similar repositories for SpeakingFaces
Users that are interested in SpeakingFaces are comparing it to the libraries listed below
Sorting:
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107Updated last year
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 3 years ago
- Facial Expression Feature Extractor☆70Updated 2 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 7 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 6 years ago
- ☆41Updated last year
- ☆46Updated last year
- This github contains the network architectures of NeuralVoicePuppetry.☆80Updated 4 years ago
- The official implementation for ICMI 2020 Best Paper Award "Gesticulator: A framework for semantically-aware speech-driven gesture gener…☆127Updated 2 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 4 years ago
- Implementation for Pre-training strategies and datasets for facial representation learning, ECCV 2022☆71Updated last year
- Download and preprocess voxceleb datasets.☆31Updated last week
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆110Updated 3 years ago
- Function to frontalize non-frontal 2D facial landmarks generated from the DLIB library☆24Updated 3 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- ☆34Updated 6 years ago
- ☆10Updated 7 months ago
- ☆21Updated 3 years ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year
- Code for the Active Speakers in Context Paper (CVPR2020)☆54Updated 4 years ago
- Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20☆65Updated 3 years ago
- This is the repository containing the solution for FG-2020 ABAW Competition☆119Updated last year
- This is the official implementation for IVA'20 Best Paper Award paper "Let's Face It: Probabilistic Multi-modal Interlocutor-aware Gener…☆16Updated 2 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆129Updated 6 months ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆72Updated 5 years ago
- An avatar simulation for AirSim (https://github.com/Microsoft/AirSim).☆77Updated 2 years ago
- A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019☆13Updated 2 years ago
- Tools for downloading VoxCeleb2 dataset☆29Updated last year