alphacep / vosk-apiLinks
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
β13,559Updated 2 weeks ago
Alternatives and similar repositories for vosk-api
Users that are interested in vosk-api are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,194Updated 3 months ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,529Updated last year
- Faster Whisper transcription with CTranslate2β18,859Updated last week
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β5,778Updated last week
- Silero Models: pre-trained text-to-speech models made embarrassingly simpleβ5,542Updated last week
- Offline speech recognition for Android with Vosk library.β959Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,238Updated last week
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntimeβ¦β8,708Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β18,582Updated 2 weeks ago
- Recurrent neural network for audio noise reductionβ5,119Updated 8 months ago
- VOSK Speech Recognition Toolkitβ480Updated 3 years ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,038Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,420Updated 2 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β8,610Updated 2 weeks ago
- End-to-End Speech Processing Toolkitβ9,562Updated this week
- OpenAI Whisper ASR Webservice APIβ2,985Updated 4 months ago
- Examples of how to use or integrate DeepSpeechβ857Updated 2 years ago
- A python package to analyze and compare voices with deep learningβ3,141Updated 2 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β26,640Updated 4 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,383Updated last year
- A small speech recognizerβ4,214Updated last week
- MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure javaβ2,549Updated 9 months ago
- Open Text to Speech Serverβ1,110Updated last year
- On-device wake word detection powered by deep learningβ4,475Updated 2 weeks ago
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, β¦β1,536Updated 2 weeks ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ43,337Updated last year
- A PyTorch-based Speech Toolkitβ10,683Updated this week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ1,948Updated 3 weeks ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,647Updated last month
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,913Updated 4 months ago