Voice activity engine benchmark framework
☆21Jan 14, 2026Updated last month
Alternatives and similar repositories for voice-activity-benchmark
Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below
Sorting:
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 5 months ago
- ☆13Oct 27, 2021Updated 4 years ago
- Control your computer by voice!☆13Dec 8, 2022Updated 3 years ago
- Script to generate VAD dataset used in Asteroid recipe☆20Sep 30, 2021Updated 4 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Multiple input multiple output switch (MIMOSA) hardware.☆24Sep 20, 2021Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Sep 18, 2022Updated 3 years ago
- Python client for Contec CMS50EW pulse oximeter☆11Apr 6, 2017Updated 8 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆45Nov 8, 2025Updated 3 months ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- Nr. 1 ranked "Pitch Detector" on the web. Implemented with WebAssembly.☆11Mar 24, 2021Updated 4 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- This tool can convert picture format(NV12/YUYV/UYVY...) to (png/jpg/bmp)☆10Jul 14, 2018Updated 7 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- Run spleeter as a pulseaudio plugin in realtime☆12Mar 24, 2023Updated 2 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Masked Face Image Augmentation Tool for Dataset 300W-LP with 6D Head Pose Information.☆12Aug 12, 2022Updated 3 years ago
- Android sound localization and classification app.☆14Jul 4, 2025Updated 8 months ago
- ☆17Apr 2, 2025Updated 11 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- ⚡️Official Image-charts Python library☆12Updated this week
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- 这是一个智能座舱中的驾驶员分心行为监测系统(DMS)☆18Aug 16, 2023Updated 2 years ago
- オーディオスペクトラムや波形をOpenCVで描画するサンプル☆14Aug 16, 2025Updated 6 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Apr 20, 2025Updated 10 months ago
- ☆26Oct 16, 2025Updated 4 months ago
- 🔊 extract runescape classic sounds from cache to wav (and vice versa)☆13Aug 2, 2022Updated 3 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago