Deepest-Project / Transformer-TTS
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Updated last year
Alternatives and similar repositories for Transformer-TTS:
Users that are interested in Transformer-TTS are comparing it to the libraries listed below
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆52Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆167Updated last year
- Implementation of the AlignTTS☆76Updated last year
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- ☆51Updated 6 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- ☆45Updated 5 years ago
- An evaluation toolkit for voice conversion models.☆41Updated 3 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Updated 3 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Implementation of Multi speaker TTS☆51Updated 4 years ago
- VQVAE for Unsupervised Voice Conversion☆21Updated 5 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 6 years ago
- ☆34Updated 5 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 3 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆132Updated 3 years ago