xashru/robust-vad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xashru/robust-vad)

xashru / robust-vad

Lightweight CNN for Robust Voice Activity Detection

☆20

Alternatives and similar repositories for robust-vad

Users that are interested in robust-vad are comparing it to the libraries listed below

Sorting:

skgusrb12 / voice_activity_detection
View on GitHub
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆27Mar 20, 2021Updated 4 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated last year
voithru / voice-activity-detection
View on GitHub
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆159Oct 26, 2021Updated 4 years ago
iariav / End-to-End-VAD
View on GitHub
an Audio-Visual Voice Activity Detection using Deep Learning
☆50Apr 7, 2019Updated 6 years ago
nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
pprablanc / ppsrt
View on GitHub
A python algorithm to change the pitch of the voice in real time
☆13Dec 13, 2020Updated 5 years ago
NickWilkinson37 / voxseg
View on GitHub
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
☆88Sep 7, 2022Updated 3 years ago
thgpddl / TensorFlowLiteEmotionDemo
View on GitHub
在Android上运行人脸表情识别的tflite模型
☆12Apr 7, 2021Updated 4 years ago
moritzhambach / CPU-vs-GPU-benchmark-on-MNIST
View on GitHub
compare training duration of CNN with CPU (i7 8550U) vs GPU (mx150) with CUDA depending on batch size
☆12Mar 24, 2018Updated 7 years ago
alterxyz / YTelegraph
View on GitHub
Python Telegraph api.
☆15Mar 22, 2025Updated 11 months ago
RicherMans / Datadriven-GPVAD
View on GitHub
The codebase for Data-driven general-purpose voice activity detection.
☆93Aug 3, 2023Updated 2 years ago
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 3 years ago
cqu20160901 / DETR_onnx_tensorRT_V2
View on GitHub
DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。
☆12Jan 9, 2024Updated 2 years ago
ankraft / ithoughtsx-styles
View on GitHub
Collection of styles for the iThoughtsX mind mapper
☆16Jun 21, 2019Updated 6 years ago
m-kazuki / AuxIVA
View on GitHub
☆11May 30, 2019Updated 6 years ago
GuillaumeVW / NSNet
View on GitHub
This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhanc…
☆40Aug 20, 2020Updated 5 years ago
Toshiba-China-RDC / dcase20_task4
View on GitHub
Couple learning on baseline of DCASE 2020 task 4
☆25Mar 9, 2022Updated 3 years ago
shizhengLi / qlib-learning
View on GitHub
基于微软开源AI量化投资平台的系统学习教程
☆27Dec 7, 2025Updated 2 months ago
smores56 / osprette-v3
View on GitHub
34-key unibody columnar keyboard with pinky clusters
☆10May 2, 2025Updated 10 months ago
mukyuuhate / SoundLocation
View on GitHub
基于pynq-z2的声源定位系统
☆14Nov 15, 2020Updated 5 years ago
sdeva14 / eusipco17-drone-sound-detection
View on GitHub
Implementation of Empirical Study of Drone Sound Detection in Real-Life Environment with Deep Neural Networks, published in EUSIPCO17
☆12Sep 6, 2021Updated 4 years ago
gfreezy / alfred-docsrs
View on GitHub
alfred workflow to search docs.rs
☆12Dec 19, 2019Updated 6 years ago
synxlin / chinese-speech-recognition
View on GitHub
This is a Chinese version of DeepSpeech2 in torch and its application. Modified from https://github.com/SeanNaren/deepspeech.torch.
☆13Jul 10, 2024Updated last year
Lmy0217 / PyTorch-aarch64
View on GitHub
PyTorch wheel for installation on aarch64 and arm64 devices
☆12May 15, 2020Updated 5 years ago
OpenEPaperLink / Tag_FW_EFR32xG22
View on GitHub
Firmware for EFR32xG22-based tags
☆13Updated this week
sshh12 / Conv-VAD
View on GitHub
A packaged convolutional voice activity detector for noisy environments.
☆14Jun 15, 2019Updated 6 years ago
Yuanbo2020 / Audio-Visual-VAD
View on GitHub
☆13May 9, 2022Updated 3 years ago
victorpanitz / ShepherdScroll
View on GitHub
Shepherd Scroll implements a custom Scroll View which provides easy handling of animation over child view controllers during the scroll.
☆15Dec 8, 2020Updated 5 years ago
EagleVee / keyboards
View on GitHub
☆17Dec 5, 2024Updated last year
weiran / watch-it-later
View on GitHub
Watch videos saved on Instapaper on your Apple TV.
☆12Dec 4, 2024Updated last year
Reletiv / OpenEPaperLink_TLSR
View on GitHub
Alpha Test repo of ATC_TLSR_OpenEPaperLink
☆13Jul 25, 2024Updated last year
IoBT-VISTEC / Machine-Learning-for-BCI
View on GitHub
CCA for SSVEP, DNN for SSVEP, ROS-BCI
☆16Jun 2, 2021Updated 4 years ago
ex3ndr / bubble-firmware
View on GitHub
Open Firmware for AI Wearables
☆15May 12, 2024Updated last year
hongfeixue / KWS_pytorch
View on GitHub
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
☆53Mar 15, 2022Updated 3 years ago
theowenyoung / dotfiles
View on GitHub
dotfiles, init scripts, common scripts
☆18Jan 13, 2024Updated 2 years ago
LeonWlw / asr_blockformer
View on GitHub
E2E ASR system
☆14Oct 20, 2022Updated 3 years ago
retsimx / tlsr8266_mesh
View on GitHub
Collection of tools and code for reprogramming Tuya BLE smart lights (TYBT1) boards based on the TLSR8266 MCU with new Rust firmware.
☆16Nov 6, 2025Updated 3 months ago
ex3ndr / datasets
View on GitHub
Declare your datasets and download them using a simple tool
☆14Aug 2, 2024Updated last year