Picovoice / voice-activity-benchmarkView external linksLinks
Voice activity engine benchmark framework
☆21Jan 14, 2026Updated last month
Alternatives and similar repositories for voice-activity-benchmark
Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below
Sorting:
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 4 months ago
- Control your computer by voice!☆13Dec 8, 2022Updated 3 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Script to generate VAD dataset used in Asteroid recipe☆20Sep 30, 2021Updated 4 years ago
- High-level API for creating dragonfly grammars☆14Oct 11, 2021Updated 4 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A curated list of 😎 awesome assistive-technology frameworks to help you develop your AT tool/system☆28Jul 6, 2020Updated 5 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Sep 18, 2022Updated 3 years ago
- Python client for Contec CMS50EW pulse oximeter☆11Apr 6, 2017Updated 8 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆45Nov 8, 2025Updated 3 months ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆160Oct 26, 2021Updated 4 years ago
- Sublime Text 3 plugin for voice coding Python 3☆13Sep 15, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Run spleeter as a pulseaudio plugin in realtime☆12Mar 24, 2023Updated 2 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- This tool can convert picture format(NV12/YUYV/UYVY...) to (png/jpg/bmp)☆10Jul 14, 2018Updated 7 years ago
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Nr. 1 ranked "Pitch Detector" on the web. Implemented with WebAssembly.☆11Mar 24, 2021Updated 4 years ago
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- ☆26Oct 16, 2025Updated 3 months ago
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆16May 9, 2025Updated 9 months ago
- Make windows installer 🪟 for flutter powered apps💻.☆13Jul 6, 2024Updated last year
- Masked Face Image Augmentation Tool for Dataset 300W-LP with 6D Head Pose Information.☆12Aug 12, 2022Updated 3 years ago
- ☆17Apr 2, 2025Updated 10 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- ☆11Nov 5, 2021Updated 4 years ago
- 这是一个智能座舱中的驾驶员分心行为监测系统(DMS)☆18Aug 16, 2023Updated 2 years ago
- ⚡️Official Image-charts Python library☆12Updated this week
- ☆12Apr 22, 2024Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 10 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Control the mouse using a keyboard or speech recognition on Linux☆12Jul 11, 2019Updated 6 years ago