alphacep / vosk-serverLinks

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

☆1,138

Alternatives and similar repositories for vosk-server

Users that are interested in vosk-server are comparing it to the libraries listed below

Sorting:

alphacep / vosk
VOSK Speech Recognition Toolkit
☆458Updated 3 years ago
mozilla / DeepSpeech-examples
Examples of how to use or integrate DeepSpeech
☆854Updated 2 years ago
alumae / kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,086Updated last year
wiseman / py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
☆2,311Updated last year
alphacep / vosk-android-demo
Offline speech recognition for Android with Vosk library.
☆914Updated last year
MycroftAI / mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
☆916Updated last year
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,350Updated last year
TensorSpeech / TensorFlowASR
TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…
☆987Updated last month
ccoreilly / vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
☆469Updated last year
coqui-ai / STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
☆2,487Updated last year
Tomiinek / Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆838Updated last year
dpirch / libfvad
Voice activity detection (VAD) library, based on WebRTC's VAD engine
☆551Updated last year
Picovoice / cheetah
On-device streaming speech-to-text engine powered by deep learning
☆634Updated 2 weeks ago
gooofy / zamia-speech
Open tools and data for cloudless automatic speech recognition
☆446Updated 4 years ago
k2-fsa / icefall
☆1,187Updated 2 weeks ago
k2-fsa / sherpa
Speech-to-text server framework with next-gen Kaldi
☆750Updated last week
k2-fsa / k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,236Updated 3 weeks ago
jcsilva / docker-kaldi-gstreamer-server
Dockerfile for kaldi-gstreamer-server.
☆289Updated 3 years ago
synesthesiam / opentts
Open Text to Speech Server
☆1,085Updated last year
wenet-e2e / wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
☆983Updated last month
seasalt-ai / snowboy
DNN based hotword and wake word detection toolkit (model generation included)
☆476Updated 4 years ago
pykaldi / pykaldi
A Python wrapper for Kaldi
☆1,022Updated 6 months ago
MontrealCorpusTools / Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
☆1,553Updated this week
YoavRamon / awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆537Updated 3 years ago
Picovoice / picovoice
On-device voice assistant platform powered by deep learning
☆657Updated 3 months ago
cmusphinx / pocketsphinx
A small speech recognizer
☆4,163Updated last week
daanzu / kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
☆342Updated 2 years ago
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,387Updated 5 months ago
MycroftAI / mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …
☆513Updated 2 years ago
athena-team / athena
an open-source implementation of sequence-to-sequence based speech processing engine
☆953Updated 2 years ago