matteo-convertino / vosk-build-modelLinks
How to create your own model for vosk
β72Updated 3 years ago
Alternatives and similar repositories for vosk-build-model
Users that are interested in vosk-build-model are comparing it to the libraries listed below
Sorting:
- Model for recasing and repunctuating ASR transcriptsβ133Updated last year
- πΈSTT integration examplesβ129Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ174Updated 6 months ago
- Kaldi based speaker verificationβ47Updated 7 years ago
- β38Updated 3 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 5 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ115Updated 2 years ago
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated last year
- Experiments to test different speech recognition systems for SEPIA Frameworkβ60Updated 2 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 4 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learningβ218Updated this week
- Official home of the Idlak Speech Synthesis Toolkitβ66Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- An online speech recognition extension toolkit of Kaldiβ56Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ102Updated 4 months ago
- A recipe for creating a Speaker Identification system built on Kaldi.β15Updated 5 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.β107Updated 2 years ago
- Adapting your own Language Model for Kaldiβ63Updated 6 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.β18Updated 6 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervβ¦β141Updated 3 weeks ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ95Updated last year
- Voice Activity Detection (VAD) using deep learning.β196Updated 5 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β206Updated 11 months ago
- Goodness of Pronunciation using Kaldi on Epa-DB databaseβ35Updated last year
- β53Updated 2 years ago
- Linguistic processing for Common Voiceβ55Updated last year
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.β32Updated 5 years ago