wiseman/py-webrtcvad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wiseman/py-webrtcvad)

wiseman / py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

☆2,492

Alternatives and similar repositories for py-webrtcvad

Users that are interested in py-webrtcvad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,645Updated this week
hcmlab / vadnet
View on GitHub
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
☆464Jun 3, 2020Updated 6 years ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,314Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,898Updated this week
wangshub / python-vad
View on GitHub
🔈 Use python to achieve voice activity detection, this little program may be helpful for voice application
☆169Dec 28, 2017Updated 8 years ago
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,703Jun 15, 2026Updated last month
HarryVolek / PyTorch_Speaker_Verification
View on GitHub
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
☆598Jan 20, 2022Updated 4 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,434Sep 22, 2025Updated 10 months ago
pykaldi / pykaldi
View on GitHub
A Python wrapper for Kaldi
☆1,038Nov 30, 2025Updated 7 months ago
microsoft / DNS-Challenge
View on GitHub
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
☆1,448Jul 25, 2024Updated last year
amsehili / auditok
View on GitHub
An voice activity detection and audio segmentation tool
☆854Updated this week
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,885Jul 7, 2026Updated 2 weeks ago
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,399Mar 14, 2022Updated 4 years ago
google / uis-rnn
View on GitHub
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,588Sep 25, 2024Updated last year
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated last week
xiongyihui / python-webrtc-audio-processing
View on GitHub
Python bindings of WebRTC Audio Processing
☆215May 7, 2025Updated last year
wenet-e2e / wenet
View on GitHub
Production First and Production Ready End-to-End Speech Recognition Toolkit
☆5,176Jun 15, 2026Updated last month
jameslyons / python_speech_features
View on GitHub
This library provides common speech features for ASR including MFCCs and filterbank energies.
☆2,423Oct 20, 2021Updated 4 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,558Mar 12, 2026Updated 4 months ago
filippogiruzzi / voice_activity_detection
View on GitHub
Voice Activity Detection based on Deep Learning & TensorFlow
☆372Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,851Jul 11, 2026Updated last week
fgnt / nara_wpe
View on GitHub
Different implementations of "Weighted Prediction Error" for speech dereverberation
☆568Mar 19, 2025Updated last year
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,558Sep 26, 2024Updated last year
dpirch / libfvad
View on GitHub
Voice activity detection (VAD) library, based on WebRTC's VAD engine
☆606Apr 2, 2024Updated 2 years ago
LCAV / pyroomacoustics
View on GitHub
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…
☆1,910Updated this week
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
Baidu-AIP / speech-vad-demo
View on GitHub
集成Webrtc的VAD，用于切分音频文件
☆343Aug 26, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
facebookresearch / denoiser
View on GitHub
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…
☆1,904Mar 14, 2023Updated 3 years ago
nttcslab-sp / kaldiio
View on GitHub
A pure python module for reading and writing kaldi ark files
☆268Mar 6, 2025Updated last year
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,577May 13, 2026Updated 2 months ago