Picovoice/voice-activity-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Picovoice/voice-activity-benchmark)

Picovoice / voice-activity-benchmark

Voice activity engine benchmark framework

☆23

Alternatives and similar repositories for voice-activity-benchmark

Users that are interested in voice-activity-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Picovoice / speech-to-intent-benchmark
View on GitHub
benchmark for Speech-to-Intent engines
☆18Updated this week
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
Picovoice / browser-extension
View on GitHub
Picovoice Browser Extension
☆17Jun 24, 2026Updated last month
wangfu91 / ten-vad-rs
View on GitHub
A Rust library for working with the TEN VAD (Voice Activity Detection) ONNX model.
☆18Apr 3, 2026Updated 3 months ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
Picovoice / octopus
View on GitHub
On-device Speech-to-Index engine powered by deep learning
☆36Apr 16, 2025Updated last year
FarFetchd / clickitongue
View on GitHub
Mic-controlled mouse clicks
☆17Oct 6, 2025Updated 9 months ago
daanzu / py-silero-vad-lite
View on GitHub
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆17Nov 25, 2024Updated last year
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
wavekat / wavekat-vad
View on GitHub
Voice Activity Detection library for Rust with a unified trait interface over multiple backends (WebRTC VAD, Silero). Includes vad-lab, a…
☆25Jun 4, 2026Updated last month
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
yuhangear / kaldi-android
View on GitHub
☆15Nov 5, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
soonsoon2 / copilot-dev-day-ewha-king
View on GitHub
GitHub Copilot Dev Day × 이화여자대학교 KING 게임 동아리 | 2025년 4월 13일 (월) 19:00~21:00 | Microsoft 서울 광화문 사옥
☆26Apr 6, 2026Updated 3 months ago
Picovoice / eagle
View on GitHub
On-device speaker recognition engine powered by deep learning
☆54Updated this week
Picovoice / falcon
View on GitHub
On-device speaker diarization powered by deep learning
☆75Updated this week
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 5 months ago
google / mimosa
View on GitHub
Multiple input multiple output switch (MIMOSA) hardware.
☆25Sep 20, 2021Updated 4 years ago
Picovoice / speaker-diarization-benchmark
View on GitHub
Speaker diarization benchmark framework
☆42Jul 17, 2026Updated last week
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
Picovoice / koala
View on GitHub
On-device noise suppression powered by deep learning
☆92Updated this week
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
Picovoice / orca
View on GitHub
On-device streaming text-to-speech engine powered by deep learning
☆141Updated this week
Picovoice / cobra
View on GitHub
On-device voice activity detection (VAD) powered by deep learning
☆266Updated this week
manmay-nakhashi / TTSizer
View on GitHub
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆18May 20, 2025Updated last year
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 11 months ago
latishab / turnsense
View on GitHub
A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.
☆60Mar 20, 2026Updated 4 months ago
homink / deepspeech.pytorch.ko
View on GitHub
☆22Jul 3, 2019Updated 7 years ago
laboratory50 / russian-spellpack
View on GitHub
Пакет словарей русского языка с поддержкой букв Е и Ё
☆15Oct 4, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
jwr1995 / PubSep
View on GitHub
Repository of published DNN speech separation recipes for a number of datasets
☆13Jan 22, 2024Updated 2 years ago
openassistive / awesome-assistivetech
View on GitHub
A curated list of 😎 awesome assistive-technology frameworks to help you develop your AT tool/system
☆30Jul 6, 2020Updated 6 years ago
daanzu / py-webrtcvad-wheels
View on GitHub
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
☆41Jan 12, 2026Updated 6 months ago
b-sigpro / neural-fcasa
View on GitHub
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
☆40Mar 12, 2025Updated last year
jwr1995 / Disable-discrete-GPU-macOS
View on GitHub
A simple shell script to disable discrete GPUs for MacBook Pros affected by GPU issues
☆21Jun 8, 2018Updated 8 years ago
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year