aqibahmad / speech2face
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆9Updated last year
Alternatives and similar repositories for speech2face:
Users that are interested in speech2face are comparing it to the libraries listed below
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆22Updated 11 months ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆79Updated 3 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆192Updated 3 years ago
- Official Code for Assem-VC @ICASSP2022☆266Updated 2 years ago
- ☆85Updated 3 years ago
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated 11 months ago
- a PyTorch implementation of Lip2Wav☆49Updated 2 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆107Updated 2 years ago
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆46Updated 2 months ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆344Updated 2 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆60Updated 4 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆113Updated 4 years ago
- ☆10Updated 3 months ago
- ☆50Updated last year
- Official implementation of SpeechSplit2☆132Updated 2 years ago
- ☆94Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 6 months ago
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆157Updated last year
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆86Updated 3 years ago
- PPG-Based Voice Conversion☆332Updated 2 years ago
- ☆129Updated last year
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆89Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆295Updated 3 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆123Updated 4 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆190Updated 2 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆114Updated last year
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆211Updated 7 months ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago