matteo-convertino / vosk-build-modelLinks
How to create your own model for vosk
β75Updated 4 years ago
Alternatives and similar repositories for vosk-build-model
Users that are interested in vosk-build-model are comparing it to the libraries listed below
Sorting:
- Model for recasing and repunctuating ASR transcriptsβ140Updated last year
- πΈSTT integration examplesβ129Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β326Updated 11 months ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 6 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ36Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β213Updated last year
- Linguistic processing for Common Voiceβ57Updated last year
- Grapheme To Phonemeβ73Updated last year
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ177Updated 10 months ago
- Official home of the Idlak Speech Synthesis Toolkitβ67Updated 3 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ40Updated 3 years ago
- This repositoryβ30Updated 2 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 4 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- Evaluate results from ASR/Speech-to-Text quicklyβ39Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ119Updated 2 years ago
- Coqui Inference Engineβ41Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB databaseβ35Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ232Updated last month
- Support tools for punctuation and boundary detection for ASR output.β56Updated 2 years ago
- Silence detection in audio stream using webrtcvadβ49Updated last year
- Persian Consonant Vowel Combination (PCVC) Speech Datasetβ19Updated 6 months ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 3 years ago
- Adapting your own Language Model for Kaldiβ63Updated 6 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- An official git mirror of Kaldi project SVN repoβ55Updated last year
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago