alphacep / voskLinks
VOSK Speech Recognition Toolkit
☆445Updated 2 years ago
Alternatives and similar repositories for vosk
Users that are interested in vosk are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,103Updated 3 weeks ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Model for recasing and repunctuating ASR transcripts☆133Updated last year
- Examples of how to use or integrate DeepSpeech☆852Updated last year
- Offline speech recognition for Android with Vosk library.☆886Updated last year
- Simple text to phones converter for multiple languages☆1,396Updated 8 months ago
- A Python wrapper for Kaldi☆1,017Updated 4 months ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆342Updated last year
- 🐸STT integration examples☆129Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆356Updated last year
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆445Updated 5 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆836Updated last year
- An audio/acoustic activity detection and audio segmentation tool☆781Updated 6 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆536Updated 3 years ago
- An official git mirror of Kaldi project SVN repo☆54Updated 9 months ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆742Updated 4 months ago
- Speech-to-text server framework with next-gen Kaldi☆709Updated this week
- A small fast portable speech synthesis system☆975Updated 11 months ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆454Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆473Updated 5 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,336Updated last year
- g2p: English Grapheme To Phoneme Conversion☆859Updated 2 years ago
- A lightweight, simple-to-use, RNN wake word listener☆904Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆316Updated 7 months ago
- ☆1,141Updated this week
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆584Updated 3 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆631Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆509Updated 2 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago