alphacep / vosk
VOSK Speech Recognition Toolkit
☆408Updated 2 years ago
Alternatives and similar repositories for vosk:
Users that are interested in vosk are comparing it to the libraries listed below
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,010Updated 6 months ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆340Updated last year
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,076Updated 9 months ago
- Offline speech recognition for Android with Vosk library.☆815Updated last year
- An official git mirror of Kaldi project SVN repo☆52Updated 7 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- Model for recasing and repunctuating ASR transcripts☆133Updated 11 months ago
- 🐸STT integration examples☆126Updated 2 years ago
- Large, modern dataset for speech recognition☆669Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆962Updated this week
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,316Updated 9 months ago
- Dockerfile for kaldi-gstreamer-server.☆289Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 5 years ago
- A Python wrapper for Kaldi☆1,010Updated 2 months ago
- FastCGI support for Kaldi ASR☆185Updated 5 years ago
- Phonetisaurus G2P☆466Updated 9 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆528Updated last year
- How to create your own model for vosk☆70Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆308Updated 4 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆613Updated 11 months ago
- Efficient neural speech synthesis☆1,161Updated 6 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆835Updated last year
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆527Updated 11 months ago
- ESPnet Model Zoo☆247Updated last year
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Updated 5 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆280Updated 2 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year