padmalcom / ttsdatasetcreator
☆23Updated last year
Related projects: ⓘ
- Tools to create your own voice dataset for TTS training☆58Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated last year
- ☆40Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆95Updated last year
- A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech from text☆45Updated last year
- ☆57Updated 2 weeks ago
- Interface for Controllable Expressive Talking Machine☆37Updated 8 months ago
- Text to Speech for Indic languages☆49Updated 2 years ago
- ☆75Updated 3 months ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆25Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆151Updated last month
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆13Updated 4 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆69Updated 2 years ago
- Linguistic processing for Common Voice☆50Updated 8 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆55Updated last year
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- Create an LJSpeech structured voice dataset on wave input☆16Updated 2 months ago
- ☆38Updated last year
- Forced Alignments for Common Voice☆29Updated 3 years ago
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆135Updated 9 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆29Updated 7 months ago
- ☆74Updated 2 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆82Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year