alpoktem / MachineDub
Automatic audiovisual translation with lip-syncing
☆10Updated 5 years ago
Alternatives and similar repositories for MachineDub:
Users that are interested in MachineDub are comparing it to the libraries listed below
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆133Updated 8 months ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 3 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆346Updated 2 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆191Updated 2 years ago
- A curated list of awesome voice conversion, projects and communities.☆223Updated last month
- Python forced alignment☆86Updated 10 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆51Updated last year
- Charsiu: A neural phonetic aligner.☆294Updated 2 years ago
- ☆129Updated last year
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆443Updated 8 months ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆12Updated 4 years ago
- Multilingual G2P in 100 languages☆300Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆35Updated last year
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆240Updated 7 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆127Updated last year
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆81Updated 3 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆115Updated 3 months ago
- an improved version of Real-time-voice-cloning☆48Updated last year
- ☆21Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆294Updated last year
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆15Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆330Updated 9 months ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆221Updated 2 years ago
- Speech to Facial Animation using GANs☆41Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 2 years ago