alphacep / vosk-apiLinks
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
β14,204Updated 2 months ago
Alternatives and similar repositories for vosk-api
Users that are interested in vosk-api are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,225Updated 6 months ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,558Updated last year
- Silero Models: pre-trained text-to-speech models made embarrassingly simpleβ5,764Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ8,125Updated last month
- A small speech recognizerβ4,267Updated 3 weeks ago
- Faster Whisper transcription with CTranslate2β20,833Updated 2 months ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β6,121Updated 2 weeks ago
- Offline speech recognition for Android with Vosk library.β1,007Updated 2 months ago
- A fast, local neural text to speech systemβ10,533Updated 5 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,112Updated 2 years ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,446Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,051Updated this week
- VOSK Speech Recognition Toolkitβ491Updated 3 years ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntimeβ¦β10,226Updated this week
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,722Updated 7 months ago
- Port of OpenAI's Whisper model in C/C++β46,518Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervisionβ94,315Updated last month
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β9,124Updated this week
- Open Text to Speech Serverβ1,119Updated last year
- https://hf.co/hexgrad/Kokoro-82Mβ5,574Updated 6 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β15,322Updated 4 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,526Updated 2 months ago
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,157Updated last year
- Production First and Production Ready End-to-End Speech Recognition Toolkitβ5,024Updated last month
- Open-source offline translation library written in Pythonβ5,633Updated last week
- A PyTorch-based Speech Toolkitβ11,181Updated this week
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,926Updated 7 months ago
- MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure javaβ2,564Updated last year
- Converts text to speech in realtimeβ3,750Updated 3 weeks ago