coqui-ai / TTS-recipes
🐸TTS recipes for different datasets
☆87Updated 2 years ago
Alternatives and similar repositories for TTS-recipes
Users that are interested in TTS-recipes are comparing it to the libraries listed below
Sorting:
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- ☆80Updated 11 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆243Updated 5 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- ☆258Updated 2 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 9 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆160Updated last year
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech from text☆48Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Python library for handling audio datasets.☆137Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- Labeled data for homograph disambiguation☆57Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)☆272Updated 3 years ago