yweweler / single-speaker-ttsLinks
This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron archicture and able to train based on unaligned text-audio pairs.
☆13Updated 6 years ago
Alternatives and similar repositories for single-speaker-tts
Users that are interested in single-speaker-tts are comparing it to the libraries listed below
Sorting:
- A Pytorch Implementation of MelGAN☆65Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆54Updated 5 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Updated 5 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 5 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 7 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- A pytroch implementation of the FB-MelGAN☆90Updated 5 years ago
- Implementation of the AlignTTS☆77Updated 2 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago
- multilingual speech aligner☆77Updated 2 years ago
- ☆90Updated 4 years ago
- ☆42Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- ☆23Updated 8 years ago
- ☆163Updated 3 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆88Updated 4 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Updated 3 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated 2 years ago
- VQVAE for Unsupervised Voice Conversion☆21Updated 6 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 6 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Updated 6 years ago
- Authors' implementation of DeepSpeech Distances.☆130Updated 5 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Updated 4 years ago