asuni / wavelet_prosody_toolkit
☆185Updated 8 months ago
Alternatives and similar repositories for wavelet_prosody_toolkit:
Users that are interested in wavelet_prosody_toolkit are comparing it to the libraries listed below
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆185Updated 4 years ago
- A Python toolbox for speech features extraction☆160Updated last year
- Charsiu: A neural phonetic aligner.☆288Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆286Updated last year
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 2 months ago
- ☆111Updated 2 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆140Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- ☆40Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆140Updated last year
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Alignment files of LibriTTS.☆61Updated 4 years ago
- Praat textgrid manipulation in Python☆51Updated 11 months ago
- ☆30Updated 4 years ago
- Mel cepstral distortion (MCD) computations in python.☆219Updated 7 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Updated 3 years ago
- This is the GitHub page for publicly available emotional speech data.☆330Updated 3 years ago
- A vocoder framework which had been widely used in research community since 1999.☆178Updated 6 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆201Updated 4 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆127Updated 6 months ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆113Updated 4 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago
- Libri-CSS: dataset and evaluation pipeline☆141Updated 2 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆83Updated 5 years ago