iariav/End-to-End-VAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iariav/End-to-End-VAD)

iariav / End-to-End-VAD

an Audio-Visual Voice Activity Detection using Deep Learning

☆52

Alternatives and similar repositories for End-to-End-VAD

Users that are interested in End-to-End-VAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
SIP-Lab / CNN-VAD
View on GitHub
A Convolutional Neural Network based Voice Activity Detector for Smartphones
☆70Apr 30, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LeadingIndiaAI / Wake-UP-word-detection
View on GitHub
Wake-up-word(WUW)system is an emerging development in recent times. Voice interaction with systems have made life ease and aids in multi-…
☆18Mar 11, 2019Updated 7 years ago
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
usc-sail / mica-speech-activity-detection
View on GitHub
Robust Speech Activity Detection (SAD) in movie audio
☆26Jan 27, 2021Updated 5 years ago
maveryn / robust-vad
View on GitHub
Lightweight CNN for Robust Voice Activity Detection
☆20Jun 30, 2023Updated 3 years ago
RicherMans / Datadriven-GPVAD
View on GitHub
The codebase for Data-driven general-purpose voice activity detection.
☆93Aug 3, 2023Updated 2 years ago
isrish / VAD-LTSD
View on GitHub
Efficient voice activity detection algorithm using long-term speech information
☆46Jan 9, 2018Updated 8 years ago
linan2 / TensorFlow-speech-enhancement
View on GitHub
DNN and RCED speech enhancement
☆20Jan 30, 2024Updated 2 years ago
magronp / omisi
View on GitHub
Online spectrogram inversion for audio source separation
☆11Oct 11, 2025Updated 9 months ago
zhenghuatan / rVAD
View on GitHub
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆140Jan 20, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
olami-developers / olami-android-hotword-detect-sdk
View on GitHub
Hotword Detection (Wake Word Detection) Android library and sample codes
☆11Apr 9, 2018Updated 8 years ago
nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
nycsv / Voice_Activity_Detector
View on GitHub
A statistical model-based Voice Activity Detection
☆196Nov 30, 2018Updated 7 years ago
hcmlab / vadnet
View on GitHub
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
☆465Jun 3, 2020Updated 6 years ago
xuchenglin28 / speech_separation
View on GitHub
Constrained Permutation Invariant Training, Speech Separation
☆52Jan 24, 2021Updated 5 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
donchev7 / MatlabCode
View on GitHub
☆14Aug 10, 2015Updated 10 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
york135 / CTC_CE_for_AST
View on GitHub
The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…
☆12Mar 25, 2025Updated last year
Yuanbo2020 / Audio-Visual-VAD
View on GitHub
☆13May 9, 2022Updated 4 years ago
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
okankop / ASDNet
View on GitHub
Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset
☆73Jan 18, 2022Updated 4 years ago
neillu23 / DiffuSE
View on GitHub
☆36Aug 21, 2021Updated 4 years ago
skgusrb12 / voice_activity_detection
View on GitHub
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆27Mar 20, 2021Updated 5 years ago
mounalab / LSTM-RNN-VAD
View on GitHub
Voice Activity Detection LSTM-RNN learning model
☆50Apr 17, 2018Updated 8 years ago
mengsaisi / VAD_campare
View on GitHub
几种VAD算法的测评
☆26Jul 31, 2020Updated 5 years ago
lzuwei / ip-avsr
View on GitHub
Audio Visual Speech Recognition
☆23Aug 9, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dpwe / pitchfilter
View on GitHub
Speech enhancement by time-varying pitch-dependent filtering of harmonics
☆27Jul 3, 2014Updated 12 years ago
filippogiruzzi / voice_activity_detection
View on GitHub
Voice Activity Detection based on Deep Learning & TensorFlow
☆373Jul 22, 2026Updated last week
MorrisXu-Driving / Speech-Augmentation-and-Endpoint-Detection
View on GitHub
This repository is developed in MATLAB. Speech Augmentation is based on Adaptive Filtering while Endpoint Detection is based on Voice Act…
☆10Dec 7, 2020Updated 5 years ago
dqhplhzz2008 / Study-notes
View on GitHub
总结了一些我的学习笔记，包括linux、C++、Java、Python、算法等，以及找工作时候的一些面经和笔记等。
☆16Jun 12, 2019Updated 7 years ago
wangkenpu / WSJ2WAV
View on GitHub
Convert WSJ sphere format to waveform and do data simulation.
☆16Feb 20, 2020Updated 6 years ago
idnavid / py_vad_tool
View on GitHub
python script for voice activity detection.
☆36Aug 16, 2024Updated last year
ifnspaml / Enhancement-Coded-Speech
View on GitHub
☆24Apr 25, 2022Updated 4 years ago