coqui-ai / TTS-recipes
🐸TTS recipes for different datasets
☆84Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for TTS-recipes
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated last year
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- ☆77Updated 5 months ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆279Updated 4 months ago
- Pytorch implementation of Deepmind's WaveRNN model☆120Updated 5 years ago
- Python library for handling audio datasets.☆131Updated last year
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆169Updated 3 months ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆331Updated 2 years ago
- ☆74Updated 3 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆283Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆98Updated last year
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆337Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆140Updated 10 months ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆158Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆224Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆189Updated 2 years ago
- ☆251Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆143Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆83Updated 3 weeks ago
- Awesome list of TTS papers with audio samples☆60Updated 4 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆280Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆241Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 3 months ago