kaituoxu / Tacotron2Links
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
☆52Updated 6 years ago
Alternatives and similar repositories for Tacotron2
Users that are interested in Tacotron2 are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- wavenet vocoder using tensorflow☆26Updated 7 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- Interspeech 2019 tutorial materials☆49Updated 5 years ago
- ☆45Updated 5 years ago
- ☆13Updated 6 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- ☆51Updated 6 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 2 years ago
- ☆31Updated 6 years ago
- VoxSRC Challenge☆31Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- ☆56Updated 6 years ago
- Core code for my ICASSP 2018 paper☆53Updated 7 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆53Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 5 years ago
- CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion☆41Updated 5 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- TTS model based on Transformer.☆58Updated 6 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 5 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 5 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Updated 2 years ago