Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.
☆16Nov 14, 2020Updated 5 years ago
Alternatives and similar repositories for tiny-kaldi
Users that are interested in tiny-kaldi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 1, 2025Updated 6 months ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆507Dec 7, 2025Updated 4 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆12Oct 1, 2019Updated 6 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- WebAssembly port of FFmpeg☆18Mar 10, 2026Updated last month
- SpeechYOLO Interspeech 2019☆47Aug 16, 2022Updated 3 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- COVID-19 FAQ chatbot in python along with user interfce☆10Feb 2, 2024Updated 2 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Oct 6, 2023Updated 2 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Mar 25, 2023Updated 3 years ago
- ☆15Updated this week
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Notebooks etc. Analysis of SNOMED-CT for the Clinical Coding Pilot and related work☆14Jan 10, 2021Updated 5 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Jan 1, 2023Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Oct 1, 2017Updated 8 years ago
- Global Open Simulator☆10May 5, 2025Updated 11 months ago
- Directional sparse filtering for blind speech separation☆10Jun 8, 2021Updated 4 years ago
- Android application to track the running status☆11May 4, 2023Updated 2 years ago
- Applying webrtc's acoustic echo cancellation (AEC) to audio files☆37Apr 21, 2016Updated 10 years ago
- Classify audio samples using a neural network☆10May 19, 2017Updated 8 years ago
- STT Service based on Kaldi ASR☆15Aug 17, 2018Updated 7 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Apertium linguistic data for Catalan☆11Mar 13, 2026Updated last month
- ☆13Oct 27, 2021Updated 4 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- Python Voice Activity Detection for Chat Bots☆14Mar 31, 2019Updated 7 years ago
- It's an app for transforming audio to text. It runs locally and helps user add captions more easily.☆24May 7, 2020Updated 5 years ago