Voice activity engine benchmark framework
☆21Jan 14, 2026Updated 2 months ago
Alternatives and similar repositories for voice-activity-benchmark
Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 27, 2021Updated 4 years ago
- Control your computer by voice!☆13Dec 8, 2022Updated 3 years ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 5 months ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- High-level API for creating dragonfly grammars☆14Oct 11, 2021Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Script to generate VAD dataset used in Asteroid recipe☆21Sep 30, 2021Updated 4 years ago
- Multiple input multiple output switch (MIMOSA) hardware.☆24Sep 20, 2021Updated 4 years ago
- A curated list of 😎 awesome assistive-technology frameworks to help you develop your AT tool/system☆28Jul 6, 2020Updated 5 years ago
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆17May 9, 2025Updated 10 months ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 7 months ago
- Python client for Contec CMS50EW pulse oximeter☆11Apr 6, 2017Updated 8 years ago
- ☆22Jul 3, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repository of published DNN speech separation recipes for a number of datasets☆12Jan 22, 2024Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 5 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 9 months ago
- A simple shell script to disable discrete GPUs for MacBook Pros affected by GPU issues☆21Jun 8, 2018Updated 7 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Sep 18, 2022Updated 3 years ago
- ☆18Jun 12, 2025Updated 9 months ago
- ⚡️Official Image-charts Python library☆12Updated this week
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆29Mar 6, 2026Updated 3 weeks ago
- Control the mouse using a keyboard or speech recognition on Linux☆12Jul 11, 2019Updated 6 years ago
- ☆10Nov 1, 2025Updated 4 months ago
- Android sound localization and classification app.☆14Jul 4, 2025Updated 8 months ago
- A terminal-based UI application for managing Google Cloud instances, inspired by k9s for Kubernetes and e1s for ECS☆42Feb 3, 2026Updated last month
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- Repo for hosting tutorial code associated with the "AssemblyAI and Python in 5 Minutes" blog by AssemblyAI☆12Jul 29, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- G.729А audio codec for python 3☆13Mar 18, 2020Updated 6 years ago
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated last year
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Dec 2, 2019Updated 6 years ago
- ☆18Oct 26, 2023Updated 2 years ago