KathyReid / opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆26Updated 2 years ago
Alternatives and similar repositories for opensource-voice-tools:
Users that are interested in opensource-voice-tools are comparing it to the libraries listed below
- ☆75Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆49Updated 7 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Code for AccentDB.☆20Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 3 years ago
- Linguistic processing for Common Voice☆55Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Command line tool to create corpora for Common Voice☆75Updated 10 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆38Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated last month
- asr2k☆49Updated 10 months ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- Dataset Release for Intent Classification from Speech☆46Updated last month
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆21Updated 5 years ago
- ☆17Updated 3 years ago
- Grapheme To Phoneme☆71Updated 8 months ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated last year