alphacep / vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
☆929Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for vosk-server
- VOSK Speech Recognition Toolkit☆383Updated 2 years ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆8,137Updated last week
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- Examples of how to use or integrate DeepSpeech☆821Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆941Updated 2 months ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆498Updated 7 months ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,073Updated 5 months ago
- Dockerfile for kaldi-gstreamer-server.☆288Updated 2 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆382Updated 10 months ago
- On-device streaming speech-to-text engine powered by deep learning☆594Updated 2 weeks ago
- Large, modern dataset for speech recognition☆646Updated 8 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- Offline speech recognition for Android with Vosk library.☆755Updated 11 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- 🐸 collection of TTS papers☆640Updated 4 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,283Updated 5 months ago
- Python interface to the WebRTC Voice Activity Detector☆2,068Updated 4 months ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆469Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,284Updated 8 months ago
- Command line utility for forced alignment using Kaldi☆1,344Updated last week
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆755Updated last week
- A Python wrapper for Kaldi☆999Updated 3 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆285Updated this week
- ☆934Updated last week
- Open-Source Large Vocabulary Continuous Speech Recognition Engine☆1,844Updated 6 months ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆355Updated last year
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago