Helsinki-NLP / prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆240Updated 5 years ago
Alternatives and similar repositories for prosody:
Users that are interested in prosody are comparing it to the libraries listed below
- ☆185Updated 9 months ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 10 months ago
- Charsiu: A neural phonetic aligner.☆292Updated 2 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆185Updated 5 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆288Updated last year
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- A Python toolbox for speech features extraction☆161Updated 2 years ago
- A pure python module for reading and writing kaldi ark files☆252Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆328Updated 9 months ago
- ☆111Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆140Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆155Updated 5 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆154Updated last year
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆130Updated 7 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated last year
- ☆273Updated 4 years ago
- CMU Wilderness Multilingual Speech Dataset☆275Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆139Updated 2 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆84Updated 5 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆130Updated last year
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆259Updated last year