alphacep / vosk
VOSK Speech Recognition Toolkit
☆390Updated 2 years ago
Alternatives and similar repositories for vosk:
Users that are interested in vosk are comparing it to the libraries listed below
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆967Updated 4 months ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 3 years ago
- Model for recasing and repunctuating ASR transcripts☆132Updated 9 months ago
- 🐸STT integration examples☆122Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆295Updated 2 months ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- Offline speech recognition for Android with Vosk library.☆773Updated last year
- An official git mirror of Kaldi project SVN repo☆51Updated 4 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,298Updated 7 months ago
- Large, modern dataset for speech recognition☆656Updated 10 months ago
- Examples of how to use or integrate DeepSpeech☆831Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆502Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 2 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆953Updated this week
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆774Updated last week
- 🐸 collection of TTS papers☆660Updated 6 months ago
- Grapheme to phoneme conversion with deep learning.☆367Updated last year
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆520Updated last year
- Phonetisaurus G2P☆457Updated 7 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆580Updated 3 years ago
- A small fast portable speech synthesis system☆914Updated 6 months ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆475Updated 3 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆151Updated 7 months ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆669Updated 2 months ago
- Text To Speech Synthesis with Vosk☆147Updated last month