alphacep / voskLinks
VOSK Speech Recognition Toolkit
☆425Updated 2 years ago
Alternatives and similar repositories for vosk
Users that are interested in vosk are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,048Updated last week
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆342Updated last year
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆836Updated last year
- An official git mirror of Kaldi project SVN repo☆53Updated 9 months ago
- Festival Speech Synthesis System☆423Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆471Updated 5 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,327Updated 11 months ago
- Model for recasing and repunctuating ASR transcripts☆133Updated last year
- 🐸STT integration examples☆128Updated 2 years ago
- Offline speech recognition for Android with Vosk library.☆856Updated last year
- Efficient neural speech synthesis☆1,169Updated 8 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Phonetisaurus G2P☆477Updated last year
- A Python wrapper for Kaldi☆1,017Updated 4 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆315Updated 6 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆536Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆444Updated 4 years ago
- On-device streaming speech-to-text engine powered by deep learning☆627Updated 3 weeks ago
- ESPnet Model Zoo☆251Updated last year
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆583Updated 3 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 8 months ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,082Updated 11 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆354Updated last year
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆172Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆975Updated last week
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆861Updated last year
- On-device speech-to-text engine powered by deep learning☆456Updated 3 weeks ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆539Updated last year