bond005/vad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bond005/vad)

bond005 / vad

Various algorithms for voice activity detection

☆22

Alternatives and similar repositories for vad

Users that are interested in vad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yakouyang / VAD
View on GitHub
voice active detection (python ver/simple and easy-to-use)
☆12May 1, 2017Updated 9 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
YorLife / webRTC-
View on GitHub
利用webRTC对语音进行处理，实现VAD和降噪处理
☆49Nov 13, 2018Updated 7 years ago
MorenoLaQuatra / vad
View on GitHub
Simple voice activity detection (VAD) algorithm in Python
☆15Aug 10, 2023Updated 2 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chenwj1989 / python-speech-enhancement
View on GitHub
a python library for speech enhancement
☆82Jun 26, 2024Updated 2 years ago
alexdoberman / ma
View on GitHub
speech enhancement algorithms for microphone arrays
☆15May 12, 2020Updated 6 years ago
Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
sooftware / speech-recognition-papers
View on GitHub
Awesome Automatic Speech Recognition (ASR) paper collection
☆22Sep 4, 2020Updated 5 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
0three / Speech-Denoise-With-Feature-Loss
View on GitHub
本项目使用中文人声的数据集，在Speech Denoising with Deep Feature Losses网络的基础上fine-tune，得到对中文音频有更好去噪效果的结果
☆30Nov 19, 2019Updated 6 years ago
ghunkins / Binaural-Source-Localization-CNN
View on GitHub
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…
☆10Dec 16, 2017Updated 8 years ago
rosinality / imputer-pytorch
View on GitHub
Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch
☆58May 3, 2020Updated 6 years ago
raphaelvdumas / noise-reduction
View on GitHub
Audio signals noise reduction
☆13Dec 27, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YiwenShaoStephen / pychain_example
View on GitHub
☆48Jan 8, 2021Updated 5 years ago
sshh12 / Conv-VAD
View on GitHub
A packaged convolutional voice activity detector for noisy environments.
☆14Jun 15, 2019Updated 7 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Jun 8, 2022Updated 4 years ago
thu-spmi / ST-NAS
View on GitHub
Efficient Neural Architecture Search via Straight-Through Gradients
☆13Nov 12, 2020Updated 5 years ago
GWLee0524 / AMTL
View on GitHub
Asymmetric Multi-Task Learning code, If you want to use it, please let me know and cite AMTL paper
☆11Aug 3, 2016Updated 9 years ago
hainan-xv / PASM
View on GitHub
Pronunciation-assisted Subword Modeling
☆31May 30, 2019Updated 7 years ago
nycsv / Speech_Enhancement_MMSE-STSA
View on GitHub
A statistical model-based Speech Enhancement Using MMSE-STSA
☆81May 9, 2018Updated 8 years ago
mwv / vad
View on GitHub
Voice Activity Detector
☆74Mar 7, 2026Updated 4 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
arief25ramadhan / sound-source-localization
View on GitHub
Four neural network architectures to classify sound source direction
☆11Oct 3, 2020Updated 5 years ago
F-Tag / python-vad
View on GitHub
py-webrtcvad wrapper for trimming speech clips
☆48Jul 3, 2022Updated 4 years ago
F-Olivieri / delay-and-sum-tutorial
View on GitHub
A tutorial on the delay and sum beamformer for microphone arrays
☆18Jun 9, 2017Updated 9 years ago
matousc89 / signalz
View on GitHub
Data generators in Python
☆14Jun 10, 2019Updated 7 years ago
Fhrozen / jrm_ssl
View on GitHub
Files for the paper: "Sound Source Localization using Deep Residual Learning"
☆24Nov 13, 2017Updated 8 years ago
upskyy / Automatic-Speech-Recognition-Models
View on GitHub
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Jan 21, 2022Updated 4 years ago
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ynop / py-ctc-decode
View on GitHub
CTC Decoder implementation with python only. Also supports language model decoding using KenLM.
☆37May 3, 2024Updated 2 years ago
hwanyyy / preprocessing-of-speech
View on GitHub
VAD + resampling | High resolution spectrogram
☆14Nov 29, 2022Updated 3 years ago
funcwj / aps
View on GitHub
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
☆146Jul 6, 2023Updated 3 years ago
zcf28 / StyleGAN-VC
View on GitHub
Voice Conversion method based on speaker style
☆14Aug 7, 2021Updated 4 years ago
xushengyuan / FastSing2
View on GitHub
An imporved version of Fastsinging singing voice synthesising system.
☆21Nov 3, 2020Updated 5 years ago
amontalban / A2Billing-Install-Script
View on GitHub
A2Billing automated install script for CentOS 5
☆15Nov 7, 2013Updated 12 years ago
trexwithoutt / Speech-Emotion-Recognition-utterancelevel-DNN
View on GitHub
Inspired work by the project of SER using ELM at Microsoft Research
☆19Jul 4, 2018Updated 8 years ago