WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
☆1,233Jul 25, 2025Updated 7 months ago
Alternatives and similar repositories for vosk-server
Users that are interested in vosk-server are comparing it to the libraries listed below
Sorting:
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆14,301Feb 22, 2026Updated last week
- VOSK Speech Recognition Toolkit☆493Jul 13, 2022Updated 3 years ago
- Speech Recognition in Asterisk with Vosk Server☆128Jun 21, 2024Updated last year
- Offline speech recognition for Android with Vosk library.☆1,014Dec 8, 2025Updated 2 months ago
- Server framework for Kaldi ASR Toolkit☆98Sep 17, 2023Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Feb 9, 2022Updated 4 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆346Nov 3, 2025Updated 4 months ago
- Open tools and data for cloudless automatic speech recognition☆446Mar 30, 2021Updated 4 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 3 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,093Jun 8, 2024Updated last year
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,331Sep 22, 2025Updated 5 months ago
- FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile softw…☆31Jul 20, 2022Updated 3 years ago
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,306Nov 19, 2025Updated 3 months ago
- FastCGI support for Kaldi ASR☆185Apr 5, 2019Updated 6 years ago
- Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC☆44May 16, 2022Updated 3 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- Adapting your own Language Model for Kaldi☆63Jan 8, 2019Updated 7 years ago
- Dockerfile for kaldi-gstreamer-server.☆291Apr 11, 2022Updated 3 years ago
- Text To Speech Synthesis with Vosk☆248Jan 12, 2026Updated last month
- A Python wrapper for Kaldi☆1,030Nov 30, 2025Updated 3 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆175Aug 9, 2023Updated 2 years ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,279Feb 24, 2026Updated last week
- Python server for communicating with Kaldi from the browser using WebRTC☆69Sep 26, 2023Updated 2 years ago
- Model for recasing and repunctuating ASR transcripts☆143Apr 10, 2024Updated last year
- Kaldi model converter to ONNX☆247Jan 27, 2023Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆184Oct 13, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆506Dec 7, 2025Updated 2 months ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆175Dec 16, 2025Updated 2 months ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Aug 5, 2021Updated 4 years ago