matteo-convertino / vosk-build-model
How to create your own model for vosk
☆66Updated 3 years ago
Alternatives and similar repositories for vosk-build-model:
Users that are interested in vosk-build-model are comparing it to the libraries listed below
- Model for recasing and repunctuating ASR transcripts☆132Updated 9 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆203Updated 6 months ago
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 3 years ago
- ☆89Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆170Updated last month
- 🐸STT integration examples☆123Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆163Updated 7 months ago
- ☆38Updated 3 years ago
- Kaldi based speaker verification☆47Updated 7 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 8 months ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ☆38Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- Grapheme To Phoneme☆70Updated 6 months ago
- A non-native English corpus for pronunciation scoring task☆121Updated 6 months ago
- An official git mirror of Kaldi project SVN repo☆51Updated 5 months ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 5 years ago
- Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC☆40Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Open models for Coqui STT☆127Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆94Updated 2 weeks ago