Deepest-Project / Transformer-TTS
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Transformer-TTS
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆86Updated 4 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- ☆51Updated 5 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- Implementation of the AlignTTS☆76Updated last year
- Gaussian Mixture VAE Tacotron☆53Updated last year
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 3 years ago
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 6 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆128Updated 4 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 4 years ago
- Implementation of Multi speaker TTS☆49Updated 3 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Google's TPGST reimplementation.☆34Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆45Updated 2 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆72Updated 3 years ago