alpoktem / MachineDub
Automatic audiovisual translation with lip-syncing
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for MachineDub
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆15Updated 9 months ago
- ☆19Updated 2 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆32Updated 8 months ago
- Parallel data voice conversion based on pix2pix☆21Updated 5 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆114Updated 4 months ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆76Updated 2 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆205Updated 2 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆106Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆110Updated 3 months ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆103Updated 5 months ago
- ☆129Updated last year
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 2 years ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆46Updated 3 months ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆427Updated 4 months ago
- ☆32Updated 2 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- Audio driven video synthesis☆40Updated 2 years ago
- Demo for 2022 Interspeech☆29Updated 2 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆59Updated this week
- Lip Synchronization (Wav2Lip).☆88Updated last week
- ☆99Updated 10 months ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆144Updated 9 months ago
- Automatically create lip-synced animations☆72Updated last month
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆81Updated last year
- ☆19Updated last month
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆47Updated last year