matteo-convertino / vosk-build-model
How to create your own model for vosk
β72Updated 3 years ago
Alternatives and similar repositories for vosk-build-model:
Users that are interested in vosk-build-model are comparing it to the libraries listed below
- πΈSTT integration examplesβ127Updated 2 years ago
- Model for recasing and repunctuating ASR transcriptsβ133Updated last year
- Linguistic processing for Common Voiceβ55Updated last year
- Open models for Coqui STTβ138Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ208Updated last week
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ112Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 5 years ago
- Kaldi based speaker verificationβ47Updated 7 years ago
- Experiments to test different speech recognition systems for SEPIA Frameworkβ60Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ100Updated 2 months ago
- Various speech datasets made available to the publicβ116Updated 4 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β353Updated last year
- β39Updated last year
- A crash course for training speech recognition models using DeepSpeech.β25Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β310Updated 5 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated last year
- Official home of the Idlak Speech Synthesis Toolkitβ66Updated 3 years ago
- Voice Activity Detection (VAD) using deep learning.β196Updated 5 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 4 years ago
- Silence detection in audio stream using webrtcvadβ47Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ171Updated 5 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.β106Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.β49Updated 4 years ago
- This repository is a collection of TTS Models in TFLiteβ192Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB databaseβ35Updated last year
- Grapheme To Phonemeβ73Updated 9 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 4 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ135Updated last year