aqibahmad / speech2face
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speech2face
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆340Updated 2 years ago
- ☆129Updated last year
- A pytorch implementation of StarGAN-VC2☆146Updated 4 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆61Updated 4 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆111Updated 3 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆241Updated 2 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆187Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 3 months ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆168Updated last year
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- PPG-Based Voice Conversion☆328Updated 2 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆154Updated 4 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆198Updated 3 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆282Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆182Updated 4 years ago
- Official Implementation of StyleTTS-VC☆164Updated last year
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆158Updated 2 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 2 years ago
- Official Code for Assem-VC @ICASSP2022☆265Updated 2 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆128Updated 3 years ago
- ☆191Updated 3 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆106Updated 2 years ago
- Generating Talking Face Landmarks from Speech☆155Updated last year
- Official implementation of SpeechSplit2☆128Updated 2 years ago
- Audio-driven facial animation generator with BiLSTM used for transcribing the speech and web interface displaying the avatar and the anim…☆34Updated 2 years ago
- This is the GitHub page for publicly available emotional speech data.☆321Updated 2 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆144Updated 9 months ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆120Updated 3 years ago