padmalcom / ttsdatasetcreatorLinks
☆22Updated last week
Alternatives and similar repositories for ttsdatasetcreator
Users that are interested in ttsdatasetcreator are comparing it to the libraries listed below
Sorting:
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Tools to create your own voice dataset for TTS training☆68Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆257Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆89Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆160Updated 2 weeks ago
- A python library to generate speech dataset from Youtube videos☆36Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Updated 3 years ago
- ☆45Updated 2 months ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆341Updated 3 years ago
- Community framework for training tortoise☆44Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆369Updated last year
- ☆67Updated 3 months ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆226Updated 2 years ago
- ☆43Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.☆400Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆341Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 4 months ago
- Linguistic processing for Common Voice☆57Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆307Updated 4 years ago