matteo-convertino / vosk-build-model
How to create your own model for vosk
☆69Updated 3 years ago
Alternatives and similar repositories for vosk-build-model:
Users that are interested in vosk-build-model are comparing it to the libraries listed below
- Model for recasing and repunctuating ASR transcripts☆133Updated 10 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 2 years ago
- Silence detection in audio stream using webrtcvad☆46Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Linguistic processing for Common Voice☆53Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆202Updated 6 months ago
- ☆38Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆197Updated this week
- Grapheme To Phoneme☆70Updated 6 months ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- Kaldi based speaker verification☆47Updated 7 years ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆189Updated 4 years ago
- Make a Wake word detection engine like "Ok, google!"☆61Updated 2 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆203Updated 3 years ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆162Updated 2 weeks ago
- 🐸STT integration examples☆124Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆172Updated 2 months ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆157Updated 7 months ago
- Various speech datasets made available to the public☆113Updated 2 months ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago