Voice activity engine benchmark framework
☆21Jan 14, 2026Updated 3 months ago
Alternatives and similar repositories for voice-activity-benchmark
Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 10, 2021Updated 4 years ago
- Control your computer by voice!☆13Dec 8, 2022Updated 3 years ago
- On-device Speech-to-Index engine powered by deep learning☆36Apr 16, 2025Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 6 months ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- High-level API for creating dragonfly grammars☆14Oct 11, 2021Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Script to generate VAD dataset used in Asteroid recipe☆21Sep 30, 2021Updated 4 years ago
- A curated list of 😎 awesome assistive-technology frameworks to help you develop your AT tool/system☆29Jul 6, 2020Updated 5 years ago
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆158Feb 23, 2026Updated last month
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 7 months ago
- Python client for Contec CMS50EW pulse oximeter☆11Apr 6, 2017Updated 9 years ago
- ☆22Jul 3, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository of published DNN speech separation recipes for a number of datasets☆12Jan 22, 2024Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Apr 3, 2026Updated 2 weeks ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 9 months ago
- A simple shell script to disable discrete GPUs for MacBook Pros affected by GPU issues☆21Jun 8, 2018Updated 7 years ago
- ☆18Jun 12, 2025Updated 10 months ago
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- The backpropagation algorithm explained and demonstrated.☆23Feb 14, 2020Updated 6 years ago
- Control the mouse using a keyboard or speech recognition on Linux☆12Jul 11, 2019Updated 6 years ago
- ☆10Nov 1, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A terminal-based UI application for managing Google Cloud instances, inspired by k9s for Kubernetes and e1s for ECS☆41Feb 3, 2026Updated 2 months ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- Android sound localization and classification app.☆14Jul 4, 2025Updated 9 months ago
- Repo for hosting tutorial code associated with the "AssemblyAI and Python in 5 Minutes" blog by AssemblyAI☆12Jul 29, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- 🔊 extract runescape classic sounds from cache to wav (and vice versa)☆13Aug 2, 2022Updated 3 years ago
- G.729А audio codec for python 3☆13Mar 18, 2020Updated 6 years ago
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Dec 2, 2019Updated 6 years ago
- ☆18Oct 26, 2023Updated 2 years ago
- Toolkit for training/adapting CMU Sphinx acoustic models☆17May 25, 2018Updated 7 years ago
- Masked Face Image Augmentation Tool for Dataset 300W-LP with 6D Head Pose Information.☆12Aug 12, 2022Updated 3 years ago