alphacep / vosk
VOSK Speech Recognition Toolkit
☆383Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for vosk
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆922Updated 2 months ago
- Model for recasing and repunctuating ASR transcripts☆129Updated 6 months ago
- Offline speech recognition for Android with Vosk library.☆754Updated 11 months ago
- 🐸STT integration examples☆121Updated 2 years ago
- Examples of how to use or integrate DeepSpeech☆821Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆338Updated last year
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆379Updated 9 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆464Updated 4 years ago
- DeepSpeech based forced alignment tool☆233Updated 3 years ago
- Efficient neural speech synthesis☆1,140Updated last month
- An official git mirror of Kaldi project SVN repo☆51Updated 2 months ago
- Simple text to phones converter for multiple languages☆1,230Updated last month
- Python interface to the WebRTC Voice Activity Detector☆2,060Updated 4 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆279Updated 4 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆326Updated 9 months ago
- On-device streaming speech-to-text engine powered by deep learning☆590Updated this week
- Large, modern dataset for speech recognition☆644Updated 8 months ago
- Festival Speech Synthesis System☆396Updated last year
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,281Updated 5 months ago
- A Python wrapper for Kaldi☆999Updated 2 months ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆942Updated 2 months ago
- ESPnet Model Zoo☆245Updated last year
- Phonetisaurus G2P☆449Updated 5 months ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- On-device speech-to-text engine powered by deep learning☆428Updated this week
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆510Updated last year
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆580Updated 3 years ago