matteo-convertino / vosk-build-modelLinks
How to create your own model for vosk
☆72Updated 4 years ago
Alternatives and similar repositories for vosk-build-model
Users that are interested in vosk-build-model are comparing it to the libraries listed below
Sorting:
- Model for recasing and repunctuating ASR transcripts☆137Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆323Updated 9 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆227Updated 2 weeks ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆213Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 8 months ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 6 years ago
- ☆38Updated 3 years ago
- Linguistic processing for Common Voice☆57Updated last year
- ☆22Updated 4 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- ☆54Updated 2 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Grapheme To Phoneme☆73Updated last year
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆363Updated last year
- ☆40Updated last year
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.☆115Updated 6 months ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆168Updated 3 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- This repository is a collection of TTS Models in TFLite☆199Updated 4 years ago