Helsinki-NLP / prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆235Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for prosody
- ☆184Updated 6 months ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆130Updated 7 months ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆184Updated 4 years ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆144Updated 2 years ago
- Charsiu: A neural phonetic aligner.☆278Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆138Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆144Updated last year
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆237Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- A pure python module for reading and writing kaldi ark files☆249Updated last year
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆189Updated 3 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆153Updated 5 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆362Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- This is the GitHub page for publicly available emotional speech data.☆322Updated 2 years ago
- ☆86Updated 2 years ago
- Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2☆81Updated 3 years ago
- Mel cepstral distortion (MCD) computations in python.☆213Updated 7 years ago
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- ☆272Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆135Updated this week
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆216Updated 3 years ago