xinyal / Gan-Speech-Synthesis-Research
This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangxi Province, conducted by Xinya Li and Professor Alan Black at CMU. Code of synthesizer is only available in speech SSH of CMU, while this repository is about the rest of the work, including extract texts, text …
☆17Updated 8 years ago
Alternatives and similar repositories for Gan-Speech-Synthesis-Research:
Users that are interested in Gan-Speech-Synthesis-Research are comparing it to the libraries listed below
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- wavenet vocoder using tensorflow☆27Updated 7 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 6 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- Util code, issues, discussions☆28Updated 6 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 6 years ago
- Network specification and demo☆35Updated 7 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 7 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- working on parallel wavenet☆25Updated 6 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆33Updated 7 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 8 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- ☆31Updated 6 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- ☆45Updated 5 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 6 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- TTS model based on Transformer.☆57Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 6 years ago
- ☆51Updated 6 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Mel spectrum based on tacotron2 for melgan speech synthesis☆15Updated last year
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago