yweweler / single-speaker-ttsLinks
This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron archicture and able to train based on unaligned text-audio pairs.
☆13Updated 7 years ago
Alternatives and similar repositories for single-speaker-tts
Users that are interested in single-speaker-tts are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the universal neural vocoder☆67Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆90Updated 5 years ago
- ☆90Updated 4 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Updated 6 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆121Updated 11 months ago
- A Pytorch Implementation of MelGAN☆66Updated 6 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 6 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 3 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- Voice Conversion Challenge 2020 CycleVAE baseline system☆131Updated 5 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Updated 4 years ago
- multilingual speech aligner☆76Updated 2 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆45Updated 6 years ago
- Implementation of the AlignTTS☆77Updated 2 years ago
- Official implementation of BVAE-TTS☆173Updated 3 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆75Updated 4 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆192Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 4 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Updated 4 years ago
- Speech (audio) subjective evaluation system☆42Updated 5 years ago
- ☆53Updated 5 years ago
- ☆67Updated 7 months ago