alphacep / vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
β9,401Updated last week
Alternatives and similar repositories for vosk-api:
Users that are interested in vosk-api are comparing it to the libraries listed below
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,040Updated 8 months ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,425Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ5,703Updated last month
- Offline speech recognition for Android with Vosk library.β839Updated last year
- On-device wake word detection powered by deep learningβ4,087Updated this week
- Python interface to the WebRTC Voice Activity Detectorβ2,225Updated 10 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,819Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β8,710Updated 3 weeks ago
- End-to-End Speech Processing Toolkitβ9,053Updated last week
- VOSK Speech Recognition Toolkitβ411Updated 2 years ago
- Examples of how to use or integrate DeepSpeechβ845Updated last year
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,924Updated 10 months ago
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ5,260Updated last year
- Offline Text To Speech synthesis for pythonβ2,314Updated 4 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,234Updated last month
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,846Updated 4 months ago
- Faster Whisper transcription with CTranslate2β15,776Updated last week
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,026Updated 3 weeks ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β968Updated this week
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,816Updated last week
- A lightweight, simple-to-use, RNN wake word listenerβ894Updated last year
- A small speech recognizerβ4,109Updated last week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ39,821Updated 8 months ago
- MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure javaβ2,463Updated 3 months ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,081Updated 11 months ago
- Open Text to Speech Serverβ1,041Updated last year
- End to end text to speech system using gruut and onnxβ829Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β7,414Updated last week
- On-device streaming speech-to-text engine powered by deep learningβ622Updated this week
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,309Updated 8 months ago