padmalcom / ttsdatasetcreatorLinks
☆22Updated 2 years ago
Alternatives and similar repositories for ttsdatasetcreator
Users that are interested in ttsdatasetcreator are comparing it to the libraries listed below
Sorting:
- Tools to create your own voice dataset for TTS training☆65Updated 4 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated 11 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated last week
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- ☆258Updated 2 years ago
- ☆42Updated 3 years ago
- ☆38Updated 3 years ago
- ☆43Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated last week
- ☆67Updated 5 months ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 3 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- ☆76Updated 3 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆335Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆60Updated 2 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- Unofficial implementation of miipher☆125Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆16Updated 5 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆35Updated last year