VOSK Speech Recognition Toolkit
☆493Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for vosk
Users that are interested in vosk are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,233Jul 25, 2025Updated 7 months ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆14,301Feb 22, 2026Updated last week
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Offline speech recognition for Android with Vosk library.☆1,014Dec 8, 2025Updated 2 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Website and documentation☆22Jan 7, 2026Updated last month
- Tacotron2 + Waveglow Russian☆43Jan 11, 2020Updated 6 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆506Dec 7, 2025Updated 2 months ago
- Adapting your own Language Model for Kaldi☆63Jan 8, 2019Updated 7 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Jul 1, 2020Updated 5 years ago
- transcribe audio feeds into public web ui☆45Aug 31, 2022Updated 3 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Aug 6, 2022Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- How to create your own model for vosk☆75Aug 14, 2021Updated 4 years ago
- Open STT☆818Mar 11, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆12Jul 18, 2025Updated 7 months ago
- Phonetisaurus G2P☆506Jun 1, 2024Updated last year
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- bash script for access to Yandex SpeechKit longRunningRecognize☆15Jan 27, 2023Updated 3 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 7 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- Open tools and data for cloudless automatic speech recognition☆446Mar 30, 2021Updated 4 years ago
- Online streaming speaker change detection model in Pytorch☆44Apr 14, 2023Updated 2 years ago
- Some tutorials used for ASR class☆31Jul 20, 2021Updated 4 years ago
- Tools for ASR Corpus Generation from Online Video☆140Feb 10, 2019Updated 7 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated last year