aqibahmad / speech2face
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speech2face
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆22Updated 8 months ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆169Updated last year
- Auto-AVSR: Lip-Reading Sentences Project☆180Updated 7 months ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆154Updated 4 years ago
- ☆91Updated 3 years ago
- ☆129Updated last year
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆78Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated 8 months ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆68Updated 5 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆158Updated 2 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆183Updated 4 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago
- A repository for generating stylized talking 3D and 3D face☆278Updated 3 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆111Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆296Updated last month
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆144Updated 3 years ago
- Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.☆110Updated 3 years ago
- ☆96Updated 9 months ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆126Updated last year
- ☆10Updated last week
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆144Updated 10 months ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆340Updated 2 years ago
- a PyTorch implementation of Lip2Wav☆49Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆81Updated last year
- implementation based on "Audio-Driven Facial Animation by Joint End-to-End Learning of Pose and Emotion"☆160Updated 4 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 2 years ago
- PPG-Based Voice Conversion☆329Updated 2 years ago
- ☆46Updated 11 months ago
- Generating Talking Face Landmarks from Speech☆156Updated last year