matteo-convertino/vosk-build-model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/matteo-convertino/vosk-build-model)

matteo-convertino / vosk-build-model

How to create your own model for vosk

☆75

Alternatives and similar repositories for vosk-build-model

Users that are interested in vosk-build-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
projecte-aina / oTranscribe-plus
View on GitHub
A free & open tool for transcribing audio interviews with offline ASR support
☆25Dec 21, 2023Updated 2 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
EmergenceAI / kotlin_speech_features
View on GitHub
This library provides common speech features for ASR including MFCCs and filterbank energies for Android and iOS.
☆29Apr 9, 2026Updated 3 months ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
OpenVoiceOS / ovos-stt-plugin-vosk
View on GitHub
vosk STT plugin for mycroft
☆15Jun 15, 2026Updated last month
asrp / python-espeak
View on GitHub
Python C extension for the eSpeak speech synthesizer
☆12Jan 23, 2021Updated 5 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
rhasspy / rhasspy-silence
View on GitHub
Silence detection in audio stream using webrtcvad
☆49Dec 9, 2023Updated 2 years ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
PhilippeRo / gst-vosk
View on GitHub
Gstreamer plugin for VOSK voice recognition engine
☆14Oct 2, 2022Updated 3 years ago
wq2012 / CurriculumVitae
View on GitHub
Curriculum Vitae of Quan Wang
☆15Dec 13, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
alphacep / unimrcp-vosk-plugin
View on GitHub
Open source cross-platform implementation of MRCP protocol
☆20Mar 3, 2022Updated 4 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
alumae / kiirkirjutaja
View on GitHub
☆58Jul 3, 2026Updated 2 weeks ago
sadrasabouri / 802.11a
View on GitHub
Software-Hardware Implementation of IEEE 802.11a Wifi Standard
☆14Apr 17, 2023Updated 3 years ago
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Open-Speech-EkStep / crowdsource-dataplatform
View on GitHub
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Mar 6, 2023Updated 3 years ago
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Updated this week
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
alikaratana / SpeakerRecognition
View on GitHub
Text-Dependent Speaker Recognition System with Machine Learning Techniques
☆10Dec 31, 2017Updated 8 years ago
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alumae / online_speaker_change_detector
View on GitHub
Online streaming speaker change detection model in Pytorch
☆44Apr 14, 2023Updated 3 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
chrisspen / punctuator2
View on GitHub
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆34Aug 10, 2020Updated 5 years ago
falabrasil / ufpalign
View on GitHub
👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro
☆13Jul 18, 2025Updated last year
georgepar / kaldi-grpc-server
View on GitHub
Deploy Kaldi models using grpc for bidirectional streaming.
☆17Sep 30, 2024Updated last year
alphacep / whisper-prompts
View on GitHub
OpenAI Whisper Prompt Examples
☆53Jul 17, 2023Updated 3 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago