padmalcom / ttsdatasetcreatorLinks
☆22Updated 3 weeks ago
Alternatives and similar repositories for ttsdatasetcreator
Users that are interested in ttsdatasetcreator are comparing it to the libraries listed below
Sorting:
- Tools to create your own voice dataset for TTS training☆67Updated 5 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆151Updated last year
- Text to Speech for Indic languages☆52Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆372Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 5 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆259Updated 2 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆161Updated 2 weeks ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Community framework for training tortoise☆44Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆89Updated last year
- Grapheme to phoneme conversion with deep learning.☆404Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Updated 3 years ago
- Linguistic processing for Common Voice☆57Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆229Updated 3 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆226Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆51Updated 4 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆309Updated 4 years ago
- ☆67Updated 4 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- ☆44Updated 2 years ago