matteo-convertino / vosk-build-modelLinks
How to create your own model for vosk
β72Updated 4 years ago
Alternatives and similar repositories for vosk-build-model
Users that are interested in vosk-build-model are comparing it to the libraries listed below
Sorting:
- Model for recasing and repunctuating ASR transcriptsβ138Updated last year
- πΈSTT integration examplesβ128Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 6 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ36Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ118Updated 2 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 4 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- β38Updated 3 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ40Updated 3 years ago
- β54Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ177Updated 9 months ago
- β37Updated 4 months ago
- Linguistic processing for Common Voiceβ57Updated last year
- Kaldi based speaker verificationβ47Updated 7 years ago
- Adapting your own Language Model for Kaldiβ63Updated 6 years ago
- Coqui Inference Engineβ41Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β214Updated last year
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated last year
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language modelβ33Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learningβ228Updated last month
- β22Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB databaseβ35Updated last year
- Grapheme To Phonemeβ73Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β325Updated 10 months ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- Server framework for Kaldi ASR Toolkitβ98Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β107Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Frameworkβ60Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago