Yeongtae / tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆30Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tacotron2
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆51Updated 5 years ago
- Implementation of the AlignTTS☆76Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated last year
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆25Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆82Updated 3 years ago
- ☆51Updated 5 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 2 years ago
- Efficient neural speech synthesis☆80Updated 3 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆87Updated last year
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- Implementation of Multi speaker TTS☆49Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 4 years ago
- A pytroch implementation of the FB-MelGAN☆86Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- A system works on singing voice synthesis☆79Updated last year
- Alignment files of LibriTTS.☆59Updated 4 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆79Updated 3 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago