dtreskunov/tiny-kaldi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dtreskunov/tiny-kaldi)

dtreskunov / tiny-kaldi

Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.

☆16

Alternatives and similar repositories for tiny-kaldi

Users that are interested in tiny-kaldi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Updated this week
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
idiap / icassp-oov-recognition
View on GitHub
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Nov 28, 2021Updated 4 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
shaypal5 / s3bp
View on GitHub
Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.
☆24Feb 26, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
georgepar / kaldi-grpc-server
View on GitHub
Deploy Kaldi models using grpc for bidirectional streaming.
☆17Sep 30, 2024Updated last year
chrisspen / punctuator2
View on GitHub
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆34Aug 10, 2020Updated 5 years ago
lallubharteja / KWS-Scripts
View on GitHub
Keyword Search Recipe for Subword ASR
☆30Jul 12, 2019Updated 7 years ago
voberoi / voice-search-with-whisper-duckdb-and-metaphone
View on GitHub
This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…
☆13May 15, 2024Updated 2 years ago
MLSpeech / speech_yolo
View on GitHub
SpeechYOLO Interspeech 2019
☆45Aug 16, 2022Updated 3 years ago
Speech-Lab-IITM / English_ASR_Challenge
View on GitHub
English ASR Challenge organized by Speech Lab, IIT Madras
☆10Feb 3, 2021Updated 5 years ago
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
yeyupiaoling / yeyupiaoling
View on GitHub
☆15Updated this week
LeonWlw / asr_blockformer
View on GitHub
E2E ASR system
☆14Oct 20, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
alikaratana / SpeakerRecognition
View on GitHub
Text-Dependent Speaker Recognition System with Machine Learning Techniques
☆10Dec 31, 2017Updated 8 years ago
sskorol / vosk-api-gpu
View on GitHub
Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC
☆45May 16, 2022Updated 4 years ago
desh2608 / css
View on GitHub
PyTorch implementation of Continuous Speech Separation
☆12Oct 5, 2022Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
MiuLab / SpokenCSE
View on GitHub
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
☆11May 19, 2023Updated 3 years ago
awasthiabhijeet / Error-Driven-ASR-Personalization
View on GitHub
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
☆11Jun 13, 2021Updated 5 years ago
JRMeyer / speakerID-challenge
View on GitHub
A recipe for creating a Speaker Identification system built on Kaldi.
☆15Jan 2, 2020Updated 6 years ago
e13000 / directional_sparse_filtering
View on GitHub
Directional sparse filtering for blind speech separation
☆11Jun 8, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
adamcsvarga / speaker-clustering
View on GitHub
Unsupervised Speaker Clustering & Speaker Recognition
☆13Jan 7, 2019Updated 7 years ago
crcresearch / GOS
View on GitHub
Global Open Simulator
☆10May 5, 2025Updated last year
antonraharja / book-opensips-101
View on GitHub
My online writings. This time its about OpenSIPS 101
☆28Mar 20, 2015Updated 11 years ago
olimiemma / Gemini-2.5-Pro-for-Audio-Transcription
View on GitHub
☆16Apr 7, 2025Updated last year
lschilli / wav-aec
View on GitHub
Applying webrtc's acoustic echo cancellation (AEC) to audio files
☆37Apr 21, 2016Updated 10 years ago
georgid / Lyrics2AudioAligner
View on GitHub
lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping
☆14Mar 14, 2018Updated 8 years ago
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
dt-rnd / wav_classifier
View on GitHub
Classify audio samples using a neural network
☆10May 19, 2017Updated 9 years ago
seven1240 / FreeSWITCH-Portal
View on GitHub
☆26Jun 5, 2013Updated 13 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mpuels / docker-py-kaldi-asr-and-model
View on GitHub
STT Service based on Kaldi ASR
☆15Aug 17, 2018Updated 7 years ago
Bobrovskih / pcap2wav
View on GitHub
Extract wav from pcap (rtp)
☆14Jul 17, 2018Updated 8 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
h7shin / audiosearchengine
View on GitHub
Python Audio Search Engine: search for audio .wav files based on percent similarity
☆14May 12, 2014Updated 12 years ago
stevecox1964 / PythonVAD
View on GitHub
Python Voice Activity Detection for Chat Bots
☆14Mar 31, 2019Updated 7 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
apertium / apertium-cat
View on GitHub
Apertium linguistic data for Catalan
☆11Mar 13, 2026Updated 4 months ago