idnavid/py_vad_tool

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idnavid/py_vad_tool)

idnavid / py_vad_tool

python script for voice activity detection.

☆36

Alternatives and similar repositories for py_vad_tool

Users that are interested in py_vad_tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yakouyang / VAD
View on GitHub
voice active detection (python ver/simple and easy-to-use)
☆12May 1, 2017Updated 9 years ago
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
nttcslab-sp / torchain
View on GitHub
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
☆20Feb 20, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
kenders2000 / MicWindNoiseGenerator
View on GitHub
A program to generate microphone wind noise audio. Ideal for generating example data for designing noise removal algorithms.
☆19Jun 4, 2018Updated 8 years ago
undali / speex_resampler
View on GitHub
Speex audio re-sampler sample project.
☆15Jan 19, 2018Updated 8 years ago
lzuwei / ip-avsr
View on GitHub
Audio Visual Speech Recognition
☆23Aug 9, 2017Updated 8 years ago
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
lxj-sjtu / TCHES2021_Pay_attention_to_the_raw_traces
View on GitHub
☆12Jun 22, 2021Updated 5 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
SIP-Lab / CNN-VAD
View on GitHub
A Convolutional Neural Network based Voice Activity Detector for Smartphones
☆70Apr 30, 2019Updated 7 years ago
steve3nto / NoiseReductionProject
View on GitHub
Noise reduction for speech enhancement using matlab
☆23Jun 14, 2015Updated 11 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hcmlab / vadnet
View on GitHub
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
☆464Jun 3, 2020Updated 6 years ago
kan-bayashi / WaveNetVocoderSamples
View on GitHub
WaveNet Vocoder Samples
☆23Aug 23, 2019Updated 6 years ago
shriphani / Listener
View on GitHub
Detect calls of attention in the surroundings
☆51Jun 10, 2013Updated 13 years ago
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
BUTSpeechFIT / torch_msbg_mbstoi
View on GitHub
Differentiable implementation of MSBG hearing loss model and MBSTOI intelligibility metric for Clarity Enhancement challenge.
☆21Nov 19, 2021Updated 4 years ago
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 4 months ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
sigsep / sigsep-mus-io
View on GitHub
Tools to convert sigsep mus dataset from STEMS <-> WAV
☆12Jul 15, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oxinabox / Kaldi-Notes
View on GitHub
Some notes on Kaldi
☆32Feb 20, 2015Updated 11 years ago
smkalami / ypea108-cma-es
View on GitHub
CMA-ES in MATLAB
☆15Dec 5, 2020Updated 5 years ago
mohit-nith / GeneralizedWOLA-SystemIdentification
View on GitHub
Subband system identification using generalized Weighted Overlap-Add (WOLA) filter bank for improved acoustic echo cancellation.
☆15May 8, 2025Updated last year
lzuwei / end-to-end-multiview-lipreading
View on GitHub
End to End Multiview Lip Reading
☆10Jan 26, 2018Updated 8 years ago
shunsukeaihara / pysas
View on GitHub
Speech Analysis and Synthesis Toolkit for Python(2.X, 3.X).
☆16Aug 27, 2019Updated 6 years ago
BornInWater / Overlap-Detection
View on GitHub
Overlapped Speech detection in Multi-party Conversations
☆22Feb 20, 2018Updated 8 years ago
ododoyo / TASNET
View on GitHub
Time-domain Audio Separation Network
☆24Aug 3, 2018Updated 7 years ago
Aworselife / DPTBF
View on GitHub
☆17Sep 12, 2023Updated 2 years ago
iariav / End-to-End-VAD
View on GitHub
an Audio-Visual Voice Activity Detection using Deep Learning
☆52Apr 7, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
idiap / CNN_QbE_STD
View on GitHub
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"
☆32Sep 3, 2018Updated 7 years ago
nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
nycsv / Voice_Activity_Detector
View on GitHub
A statistical model-based Voice Activity Detection
☆196Nov 30, 2018Updated 7 years ago
qqueing / speaker_embedding-pytorch
View on GitHub
"An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation
☆19Oct 8, 2018Updated 7 years ago
Sytronik / denoising-wavenet-pytorch
View on GitHub
☆24Jul 22, 2019Updated 7 years ago
ctralie / GeometricCoverSongs
View on GitHub
Geometry features for block window cover song identification (a continuation of my ISMIR 2015 paper)
☆24Jul 6, 2023Updated 3 years ago