maveryn/robust-vad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maveryn/robust-vad)

maveryn / robust-vad

Lightweight CNN for Robust Voice Activity Detection

☆20

Alternatives and similar repositories for robust-vad

Users that are interested in robust-vad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

skgusrb12 / voice_activity_detection
View on GitHub
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆27Mar 20, 2021Updated 5 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
voithru / voice-activity-detection
View on GitHub
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆159Oct 26, 2021Updated 4 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
pprablanc / ppsrt
View on GitHub
A python algorithm to change the pitch of the voice in real time
☆13Dec 13, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
aminul-huq / Speech-Command-Classification
View on GitHub
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been train…
☆10Dec 5, 2022Updated 3 years ago
yinruiqing / tiny-transducer
View on GitHub
Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices
☆30Aug 4, 2022Updated 3 years ago
gaochangw / DeltaRNN
View on GitHub
Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Data
☆18Sep 30, 2023Updated 2 years ago
iariav / End-to-End-VAD
View on GitHub
an Audio-Visual Voice Activity Detection using Deep Learning
☆52Apr 7, 2019Updated 7 years ago
NickWilkinson37 / voxseg
View on GitHub
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
☆88Sep 7, 2022Updated 3 years ago
thgpddl / TensorFlowLiteEmotionDemo
View on GitHub
在Android上运行人脸表情识别的tflite模型
☆12Apr 7, 2021Updated 5 years ago
anant-pathak / EyeContactCorrection_with_FrozenModel
View on GitHub
☆15Sep 25, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
sshh12 / Conv-VAD
View on GitHub
A packaged convolutional voice activity detector for noisy environments.
☆14Jun 15, 2019Updated 7 years ago
cqu20160901 / DETR_onnx_tensorRT_V2
View on GitHub
DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。
☆12Jan 9, 2024Updated 2 years ago
jamesrequa / GAN-Image-Classifier
View on GitHub
Build a GAN for image classification using semi-supervised learning.
☆10Jul 1, 2017Updated 9 years ago
I-Man-H / DeepVADNet
View on GitHub
☆13Jun 22, 2026Updated last month
moritzhambach / CPU-vs-GPU-benchmark-on-MNIST
View on GitHub
compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size
☆12Mar 24, 2018Updated 8 years ago
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 4 years ago
fclearner / Personal-vad-2.0
View on GitHub
Implementation of "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"
☆16Jun 9, 2026Updated last month
Yuanbo2020 / Audio-Visual-VAD
View on GitHub
☆13May 9, 2022Updated 4 years ago
zuhairmhtb / AudioClassification
View on GitHub
This software is a demonstration of Audio Signal Processing and Machine Learning using Python and Tensorflow. The software contains a GU…
☆12Dec 7, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gsmafra / lee-2009-audio
View on GitHub
Unsupervised feature learning for audio classification using convolutional deep belief networks
☆11Jul 25, 2015Updated 11 years ago
hongfeixue / KWS_pytorch
View on GitHub
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
☆56Mar 15, 2022Updated 4 years ago
raymondxyy / strfnet-IS2020
View on GitHub
Official repo for the STRFNet system appeared in INTERSPEECH2020
☆12Mar 6, 2021Updated 5 years ago
RickyMexx / 3D-Sound-Localization
View on GitHub
Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.
☆19Nov 21, 2022Updated 3 years ago
freekatz / pcap-tutorial
View on GitHub
PCAP 从入门到成神
☆13Sep 26, 2024Updated last year
jagger2048 / WebRtc_AGC1
View on GitHub
This repository is webrtc agc module demo.
☆12Jan 23, 2019Updated 7 years ago
m-kazuki / AuxIVA
View on GitHub
☆12May 30, 2019Updated 7 years ago
Okrio / deepvqe
View on GitHub
☆14Oct 12, 2023Updated 2 years ago
AlexKly / Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex
View on GitHub
Voice Activity Detector based on MFCC features and DNN model
☆30Jul 3, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hcmlab / vadnet
View on GitHub
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
☆464Jun 3, 2020Updated 6 years ago
CaA23187 / VAD-based-on-LSTM
View on GitHub
A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.
☆13Dec 3, 2020Updated 5 years ago
adam2go / mfcc
View on GitHub
Calculate MFCC/Fbank feature for wav files
☆15Nov 21, 2017Updated 8 years ago
moyemoji / CTPN
View on GitHub
用于文本行检测的深度学习网络结构CTPN，VGG16 + RNN + FC
☆17Mar 14, 2019Updated 7 years ago
RicherMans / GPV
View on GitHub
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆141Aug 3, 2023Updated 2 years ago
huangyz0918 / kws-continual-learning
View on GitHub
[ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting
☆17Jun 7, 2022Updated 4 years ago
aws-samples / serverless-websocket-chat
View on GitHub
This is a fully-serverless real-time chat example using API Gateway, Lambda and DynamoDB. Messages from one client are broadcast to all o…
☆17Jun 18, 2024Updated 2 years ago