ZackHodari / average_prosody
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Workshop
☆23Updated 4 years ago
Related projects: ⓘ
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- Interspeech 2019 tutorial materials☆48Updated 4 years ago
- ☆34Updated 5 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆36Updated 4 years ago
- ☆26Updated 3 years ago
- ☆16Updated this week
- using world vocoder to extract features and make data for training neural networks☆11Updated 6 years ago
- ☆51Updated 5 years ago
- ☆23Updated this week
- Google's TPGST reimplementation.☆34Updated 4 years ago
- an tutorial implement of voice conversion using pytorch☆35Updated 6 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- An evaluation toolkit for voice conversion models.☆39Updated 3 years ago
- ☆22Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆84Updated 4 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆38Updated 4 years ago
- Speech enhancement using mimic loss☆15Updated 4 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 3 years ago
- ☆10Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 4 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆35Updated 4 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 4 years ago
- ☆45Updated 4 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago