alphacep/vosk-api

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alphacep/vosk-api)

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

☆14,937

Alternatives and similar repositories for vosk-api

Users that are interested in vosk-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alphacep / vosk-server
View on GitHub
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
☆1,258Jul 25, 2025Updated 11 months ago
alphacep / vosk-android-demo
View on GitHub
Offline speech recognition for Android with Vosk library.
☆1,053Dec 8, 2025Updated 7 months ago
mozilla / DeepSpeech
View on GitHub
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…
☆26,770Jun 19, 2025Updated last year
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,425Sep 22, 2025Updated 9 months ago
alphacep / vosk
View on GitHub
VOSK Speech Recognition Toolkit
☆500Jul 13, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆104,926Apr 15, 2026Updated 3 months ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,750Aug 16, 2024Updated last year
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆51,802Updated this week
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,276Nov 19, 2025Updated 7 months ago
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,558Updated this week
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,584Jul 3, 2026Updated last week
coqui-ai / STT
View on GitHub
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
☆2,591Mar 11, 2024Updated 2 years ago
snakers4 / silero-models
View on GitHub
Silero Models: pre-trained text-to-speech models made embarrassingly simple
☆6,011Jun 4, 2026Updated last month
ccoreilly / vosk-browser
View on GitHub
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
☆525Dec 7, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,683Jun 15, 2026Updated last month
cmusphinx / pocketsphinx
View on GitHub
A small speech recognizer
☆4,324Jun 29, 2026Updated 2 weeks ago
rhasspy / piper
View on GitHub
A fast, local neural text to speech system
☆11,221Aug 26, 2025Updated 10 months ago
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,066Updated this week
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,889Updated this week
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,235Updated this week
PaddlePaddle / PaddleSpeech
View on GitHub
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…
☆12,644Jun 21, 2026Updated 3 weeks ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,259Jun 9, 2026Updated last month
Picovoice / porcupine
View on GitHub
On-device wake word detection powered by deep learning
☆4,889Jul 3, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Uberi / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆8,971Jun 16, 2026Updated 3 weeks ago
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,196Aug 19, 2024Updated last year
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,936Apr 19, 2025Updated last year
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,023Mar 9, 2026Updated 4 months ago
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆120,346Updated this week
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,277Updated this week
NVIDIA-NeMo / Speech
View on GitHub
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…
☆17,770Updated this week
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆176,107Updated this week
espeak-ng / espeak-ng
View on GitHub
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
☆6,643Jun 29, 2026Updated 2 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wenet-e2e / wenet
View on GitHub
Production First and Production Ready End-to-End Speech Recognition Toolkit
☆5,167Jun 15, 2026Updated last month
mudler / LocalAI
View on GitHub
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
☆47,537Updated this week
tauri-apps / tauri
View on GitHub
Build smaller, faster, and more secure desktop and mobile applications with a web frontend.
☆109,048Updated this week
mozilla / TTS
View on GitHub
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10,161Nov 9, 2023Updated 2 years ago
rustdesk / rustdesk
View on GitHub
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
☆118,225Updated this week
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,860Nov 19, 2024Updated last year
flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,440Updated this week