alphacep / vosk-apiLinks
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
β12,666Updated 2 months ago
Alternatives and similar repositories for vosk-api
Users that are interested in vosk-api are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,125Updated last month
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,465Updated last year
- A small speech recognizerβ4,149Updated this week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,898Updated last year
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,380Updated last year
- Offline speech recognition for Android with Vosk library.β899Updated last year
- Examples of how to use or integrate DeepSpeechβ852Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ6,254Updated last month
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,271Updated last week
- Port of OpenAI's Whisper model in C/C++β41,410Updated last week
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,518Updated 3 weeks ago
- Real time transcription with OpenAI Whisper.β2,778Updated 2 months ago
- A nearly-live implementation of OpenAI's Whisper.β3,080Updated last week
- Faster Whisper transcription with CTranslate2β16,978Updated last month
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,895Updated 3 weeks ago
- A PyTorch-based Speech Toolkitβ10,113Updated last week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ41,332Updated 10 months ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntimeβ¦β6,651Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,096Updated 2 weeks ago
- An Open Source text-to-speech system built by inverting Whisper.β4,308Updated last month
- Offline Text To Speech synthesis for pythonβ2,368Updated this week
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.β14,806Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β16,714Updated last week
- OpenAI Whisper ASR Webservice APIβ2,722Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervisionβ84,679Updated 2 weeks ago
- https://hf.co/hexgrad/Kokoro-82Mβ3,484Updated last week
- Open-source offline translation library written in Pythonβ4,600Updated this week
- On-device wake word detection powered by deep learningβ4,253Updated this week
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β7,854Updated last week
- End-to-End Speech Processing Toolkitβ9,279Updated last week