alpoktem / movie2parallelDBLinks
Automatic parallel speech database extractor from dubbed movies
☆26Updated last year
Alternatives and similar repositories for movie2parallelDB
Users that are interested in movie2parallelDB are comparing it to the libraries listed below
Sorting:
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- ☆34Updated 4 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Updated 3 years ago
- ☆53Updated 5 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Updated 3 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- ☆25Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- ☆42Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 5 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Updated 3 years ago
- ☆37Updated 4 years ago
- ☆25Updated 6 years ago
- ☆80Updated 5 months ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 4 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 5 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 4 years ago
- ☆37Updated 4 years ago
- asr2k☆52Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 6 years ago
- ☆52Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆63Updated 4 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 5 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Updated 4 years ago