ishandutta2007 / Text-to-Speech-Landscape
☆45Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Text-to-Speech-Landscape
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆13Updated 4 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- ☆25Updated 2 years ago
- Long audio alignment using Kaldi☆25Updated 3 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- Deep Convolution Text to Speech☆35Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- Tensorflow Implementation of Expressive Tacotron☆197Updated 6 years ago
- ☆21Updated 6 years ago
- ☆255Updated last year
- asr2k☆48Updated 5 months ago
- A Collection of Speech Corpus for ASR and TTS☆112Updated 7 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- Implementation of Multi speaker TTS☆49Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated 10 months ago
- style token with tacotron2☆61Updated last year
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year