alpoktem / movie2parallelDBLinks
Automatic parallel speech database extractor from dubbed movies
☆26Updated last year
Alternatives and similar repositories for movie2parallelDB
Users that are interested in movie2parallelDB are comparing it to the libraries listed below
Sorting:
- ☆34Updated 4 years ago
- Official Implementation of Mockingjay in Pytorch☆56Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆82Updated 2 years ago
- ☆52Updated 4 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆38Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 3 years ago
- ☆25Updated 3 years ago
- ☆80Updated last month
- Implementation of Multi speaker TTS☆51Updated 4 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 4 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Updated 3 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 5 years ago
- asr2k☆52Updated last year
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 4 years ago
- Official code for Wav2Seq☆96Updated 3 years ago
- ☆52Updated 4 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- ☆38Updated 4 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 months ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆61Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Updated 3 years ago