SIP-Lab/CNN-VAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SIP-Lab/CNN-VAD)

SIP-Lab / CNN-VAD

A Convolutional Neural Network based Voice Activity Detector for Smartphones

☆70

Alternatives and similar repositories for CNN-VAD

Users that are interested in CNN-VAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
iariav / End-to-End-VAD
View on GitHub
an Audio-Visual Voice Activity Detection using Deep Learning
☆52Apr 7, 2019Updated 7 years ago
robin1001 / nn-vad
View on GitHub
simple dnn based vad
☆69Dec 2, 2018Updated 7 years ago
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
nycsv / Voice_Activity_Detector
View on GitHub
A statistical model-based Voice Activity Detection
☆196Nov 30, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
Mo-yun / DSDPRNN
View on GitHub
Implementation of Dual-Stream DPRNN (paper: Nonlinear Residual Echo Suppression Based on Dual-Stream DPRNN)
☆21Jul 15, 2021Updated 5 years ago
idnavid / py_vad_tool
View on GitHub
python script for voice activity detection.
☆36Aug 16, 2024Updated last year
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
mindorii / kws
View on GitHub
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
☆387Mar 24, 2023Updated 3 years ago
kleinzcy / speech_signal_processing
View on GitHub
☆15Jul 15, 2019Updated 7 years ago
pgys / NoIze
View on GitHub
A selective noise filter architecture driven by a CNN and Wiener filter
☆17Nov 21, 2019Updated 6 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
bond005 / vad
View on GitHub
Various algorithms for voice activity detection
☆22Jan 31, 2017Updated 9 years ago
sid0710 / audio_data_augmentation
View on GitHub
☆26Sep 14, 2017Updated 8 years ago
wangkenpu / Adaptation-Interspeech18
View on GitHub
Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model
☆13Nov 25, 2019Updated 6 years ago
Shb742 / rnnoise_python
View on GitHub
python wrapper for rnnoise library
☆48Jan 5, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rajathkmp / speaker-verification
View on GitHub
Implementation of state of the art d-vector approach for speaker verification
☆127Oct 1, 2017Updated 8 years ago
ZitengWang / python_kaldi_features
View on GitHub
python codes to extract MFCC and FBANK speech features for Kaldi
☆67Nov 28, 2018Updated 7 years ago
NuanceDev / pyspeex
View on GitHub
Python Speex
☆23Aug 10, 2017Updated 8 years ago
hcmlab / vadnet
View on GitHub
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
☆464Jun 3, 2020Updated 6 years ago
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
filippogiruzzi / voice_activity_detection
View on GitHub
Voice Activity Detection based on Deep Learning & TensorFlow
☆372Updated this week
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
breizhn / DNS-Challenge
View on GitHub
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open so…
☆15May 15, 2020Updated 6 years ago
ANLGBOY / MADE-with-PyTorch
View on GitHub
MADE:Masked-Autoencoder-for-Distribution-Estimation-using-PyTorch
☆14Aug 20, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
xiangxyq / 3gpp_vad
View on GitHub
3gpp协议26073里面的vad的移植
☆14Feb 14, 2019Updated 7 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
tqbl / dcase2018_task2
View on GitHub
Surrey CVSSP DCASE 2018 Task 2 system
☆20Dec 26, 2022Updated 3 years ago
yinruiqing / fsmn
View on GitHub
Feedforward Sequential Memory Networks
☆18Aug 2, 2022Updated 3 years ago
NickWilkinson37 / voxseg
View on GitHub
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
☆88Sep 7, 2022Updated 3 years ago