xinyal / Gan-Speech-Synthesis-ResearchLinks
This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangxi Province, conducted by Xinya Li and Professor Alan Black at CMU. Code of synthesizer is only available in speech SSH of CMU, while this repository is about the rest of the work, including extract texts, text …
☆17Updated 9 years ago
Alternatives and similar repositories for Gan-Speech-Synthesis-Research
Users that are interested in Gan-Speech-Synthesis-Research are comparing it to the libraries listed below
Sorting:
- pytorch implementation of lyre.ai's char2wav model☆32Updated 8 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 7 years ago
- ☆56Updated 7 years ago
- ☆31Updated 7 years ago
- Network specification and demo☆35Updated 8 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆94Updated 7 years ago
- Deep CNN networks for Speech Synthesis☆49Updated 7 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 6 years ago
- Mel spectrum based on tacotron2 for melgan speech synthesis☆15Updated 2 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- ☆13Updated 7 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- Core code for my ICASSP 2018 paper☆53Updated 7 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 6 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- TTS model based on Transformer.☆58Updated 6 years ago
- Voice conversion tools for STRAIGHT☆29Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Updated 7 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- An implementation of Tacotron and Tacotron2☆80Updated 4 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 6 years ago
- Interspeech 2019 tutorial materials☆49Updated 6 years ago