alpoktem / movie2parallelDBLinks
Automatic parallel speech database extractor from dubbed movies
☆26Updated last year
Alternatives and similar repositories for movie2parallelDB
Users that are interested in movie2parallelDB are comparing it to the libraries listed below
Sorting:
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- ☆34Updated 4 years ago
- ☆37Updated 4 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago
- ☆80Updated 4 months ago
- ☆52Updated 5 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆38Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 5 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 3 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Updated 3 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Updated last year
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆48Updated 4 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Updated 5 months ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Updated 6 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- Official code for Wav2Seq☆97Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 4 years ago
- ☆25Updated 6 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Updated 4 years ago
- ☆42Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Updated 2 years ago