coqui-ai / TTS-recipes
๐ธTTS recipes for different datasets
โ84Updated 2 years ago
Related projects โ
Alternatives and complementary repositories for TTS-recipes
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.โ100Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelโ106Updated 3 years ago
- VCTK multi-speaker tacotron for ICASSP 2020โ265Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowโ128Updated 3 years ago
- Tools to create your own voice dataset for TTS trainingโ61Updated 4 years ago
- This repository is a collection of TTS Models in TFLiteโ189Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"โ237Updated 4 years ago
- A python library to generate speech dataset from Youtube videosโ35Updated 5 months ago
- Python library for handling audio datasets.โ131Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisperโ99Updated last year
- โ251Updated last year
- Pytorch implementation of Deepmind's WaveRNN modelโ121Updated 5 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoderโ170Updated 3 months ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksโ62Updated 4 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.โ285Updated this week
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingโ189Updated 3 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Textโ235Updated 5 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesโ144Updated last year
- โ74Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xโ71Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)โ138Updated last year
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllablโฆโ158Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseโ84Updated last month
- ๐ธSTT integration examplesโ121Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesโ130Updated 7 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, โฆโ283Updated last year
- Collect Voice Conversion researchesโ90Updated this week
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechโ331Updated 2 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentโ72Updated 2 years ago
- Command line tool to create corpora for Common Voiceโ75Updated 5 months ago