aqibahmad / speech2faceLinks
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆13Updated 2 years ago
Alternatives and similar repositories for speech2face
Users that are interested in speech2face are comparing it to the libraries listed below
Sorting:
- ☆55Updated 2 years ago
- A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication,…☆82Updated 4 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 4 years ago
- ☆50Updated 2 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆72Updated 5 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆129Updated 6 months ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆110Updated 3 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆121Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆108Updated last year
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆60Updated 5 years ago
- ☆95Updated 4 years ago
- ☆21Updated 3 years ago
- InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild (ICPR2022)☆8Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆24Updated last year
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆85Updated 3 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆302Updated 3 years ago
- A curated list of awesome voice conversion, projects and communities.☆236Updated 5 months ago
- ☆130Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆350Updated 3 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆247Updated 3 years ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 3 years ago
- Tools for downloading VoxCeleb2 dataset☆29Updated last year
- ☆30Updated 4 years ago
- Official Implementation of StyleTTS-VC☆184Updated 5 months ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆54Updated 3 months ago
- Speech to Facial Animation using GANs☆40Updated 3 years ago