Edresson / TTSLinks
Deep learning for Text to Speech
☆27Updated 4 years ago
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- Open Source Text-To-Speech Portuguese Dataset☆172Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated 2 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- This repository is a collection of TTS Models in TFLite☆195Updated 4 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 6 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆338Updated 3 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆87Updated 4 years ago
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…☆72Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 11 months ago
- A sequence-to-sequence voice conversion toolkit.☆101Updated last year
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆89Updated 3 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆249Updated 3 years ago