nii-yamagishilab / self-attention-tacotron
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
☆114Updated 4 years ago
Alternatives and similar repositories for self-attention-tacotron:
Users that are interested in self-attention-tacotron are comparing it to the libraries listed below
- ☆51Updated 5 years ago
- parallel wavenet based on nsynth☆107Updated 6 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- A pytroch implementation of the FB-MelGAN☆88Updated 4 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated last year
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆126Updated 5 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- ☆42Updated 6 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆79Updated 5 years ago
- Code to train and run Blow☆143Updated 5 years ago
- ☆64Updated last year
- Implementation of the AlignTTS☆76Updated last year
- style token with tacotron2☆61Updated last year
- ☆91Updated 3 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 4 years ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- A fast cnn-based vocoder☆78Updated 4 years ago
- Implementation of voice conversion system utilizing phonetic posteriorgrams (status: archive)☆81Updated 4 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆92Updated 6 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Voice conversion (VC) investigation using three variants of VAE☆57Updated 5 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago