padmalcom / ttsdatasetcreator
☆23Updated 2 years ago
Alternatives and similar repositories for ttsdatasetcreator:
Users that are interested in ttsdatasetcreator are comparing it to the libraries listed below
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆65Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 8 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆171Updated last month
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆168Updated 4 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- ☆79Updated 8 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆157Updated this week
- A python library to generate speech dataset from Youtube videos☆36Updated 7 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- speaker diarization system using an LSTM☆49Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆69Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago
- ☆43Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆209Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- ☆74Updated 3 years ago
- 🐸STT integration examples☆123Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆25Updated 2 years ago