alphacep / vosk
VOSK Speech Recognition Toolkit
☆396Updated 2 years ago
Alternatives and similar repositories for vosk:
Users that are interested in vosk are comparing it to the libraries listed below
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆989Updated 5 months ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- An official git mirror of Kaldi project SVN repo☆51Updated 5 months ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- Offline speech recognition for Android with Vosk library.☆790Updated last year
- Model for recasing and repunctuating ASR transcripts☆133Updated 10 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- A Python wrapper for Kaldi☆1,006Updated 3 weeks ago
- 🐸STT integration examples☆125Updated 2 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,308Updated 8 months ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆505Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆467Updated 4 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆358Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆956Updated this week
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆478Updated 3 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆778Updated last month
- Examples of how to use or integrate DeepSpeech☆836Updated last year
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆291Updated 3 years ago
- g2p: English Grapheme To Phoneme Conversion☆836Updated 2 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆407Updated last year
- An audio/acoustic activity detection and audio segmentation tool☆765Updated 2 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆582Updated 3 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆366Updated 2 months ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆153Updated 8 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆604Updated 9 months ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 5 months ago
- Simple text to phones converter for multiple languages☆1,333Updated 4 months ago