CaA23187/VAD-based-on-LSTM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CaA23187/VAD-based-on-LSTM)

CaA23187 / VAD-based-on-LSTM

A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.

☆13

Alternatives and similar repositories for VAD-based-on-LSTM

Users that are interested in VAD-based-on-LSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
skgusrb12 / voice_activity_detection
View on GitHub
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆27Mar 20, 2021Updated 5 years ago
CaA23187 / GCCRN_full
View on GitHub
A pytorch implementation of GCCRN
☆14Dec 18, 2021Updated 4 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
NjuHaoZhang / ConvLSTM-AE_VAD_ICME2017
View on GitHub
ConvLSTM-AE_VAD_ICME2017 (code reimplementation)
☆21Oct 10, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
StellanLi / EchoFree
View on GitHub
☆18Feb 22, 2025Updated last year
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
eran-shahar / Double-talk-Detection-aided-Residual-Echo-Suppression-via-Spectrogram-Masking-and-Refinement
View on GitHub
☆29Jul 9, 2022Updated 4 years ago
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆23Updated this week
CharlesThaCat / acoustic-interference-cancellation
View on GitHub
acoustic interference (echo) cancellation project in summer internship
☆91Aug 31, 2018Updated 7 years ago
bagustris / w2v2-vad
View on GitHub
A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition
☆22Aug 9, 2023Updated 2 years ago
thgpddl / TensorFlowLiteEmotionDemo
View on GitHub
在Android上运行人脸表情识别的tflite模型
☆12Apr 7, 2021Updated 5 years ago
qiu931110 / RepDistiller
View on GitHub
[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods
☆10Jan 12, 2020Updated 6 years ago
vipchengrui / MASG
View on GitHub
microphone array speech generator (MASG) in room acoustic
☆39Jan 2, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sshh12 / Conv-VAD
View on GitHub
A packaged convolutional voice activity detector for noisy environments.
☆14Jun 15, 2019Updated 7 years ago
alvarobartt / covid-daily
View on GitHub
🦠 COVID-19 Daily Data from Worldometers with Python
☆13Feb 28, 2021Updated 5 years ago
voithru / voice-activity-detection
View on GitHub
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆159Oct 26, 2021Updated 4 years ago
wangkenpu / Adaptation-Interspeech18
View on GitHub
Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model
☆13Nov 25, 2019Updated 6 years ago
audiolabs / anf-generator
View on GitHub
Generating non-stationary multi-sensor signals under a spatial coherence constraint (Python)
☆31Apr 12, 2026Updated 3 months ago
fuyufjh / tensorflow-and-deep-learning-chinese
View on GitHub
TensorFlow and deep learning without a PhD, translated to Chinese
☆17Feb 18, 2017Updated 9 years ago
I-Man-H / DeepVADNet
View on GitHub
☆13Jun 22, 2026Updated last month
Anwarvic / VAD_Benchmark
View on GitHub
Benchmarking different VAD models on AVA-Speech dataset
☆19May 21, 2023Updated 3 years ago
sp-uhh / deep-non-linear-filter
View on GitHub
☆82Feb 9, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
RicherMans / Dcase2018_pooling
View on GitHub
Repo for our pooling approach on the DCASE2018 task4
☆16Jul 6, 2023Updated 3 years ago
fclearner / Personal-vad-2.0
View on GitHub
Implementation of "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"
☆16Jun 9, 2026Updated last month
TheKangChen / crosstalk-cancellation
View on GitHub
Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.
☆12Sep 12, 2024Updated last year
amusi / awesome-semantic-segmentation
View on GitHub
awesome-semantic-segmentation
☆11Jun 6, 2018Updated 8 years ago
AkojimaSLP / Neural-mask-estimation
View on GitHub
☆46Dec 5, 2019Updated 6 years ago
jingdao / multiview_segmentation
View on GitHub
☆11Nov 25, 2020Updated 5 years ago
Benjamin-Tsui / HRTF_preprocessing
View on GitHub
HRTF data preparation for machine learning by finding common measurement angles
☆12May 14, 2019Updated 7 years ago
yoongi43 / music_source_separation
View on GitHub
☆14Jan 12, 2023Updated 3 years ago
VikasTokala / BCCTN
View on GitHub
☆33Jun 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YoungJay0612 / Speech-Simulation-Tools
View on GitHub
语音增强领域的相关数据仿真工具和方法汇总--持续更新
☆45Jul 11, 2024Updated 2 years ago
Andong-Li-speech / TaylorBeamformer
View on GitHub
The implementation of TaylorBeamformer, which is in submission to Interspeech2022
☆49Jun 10, 2022Updated 4 years ago
robin1001 / nn-vad
View on GitHub
simple dnn based vad
☆69Dec 2, 2018Updated 7 years ago
Dahan-Wang / Rethinking-Flow-and-Diffusion-Bridge-Models-for-Speech-Enhancement
View on GitHub
☆39Feb 23, 2026Updated 5 months ago
CaA23187 / bike-fitting
View on GitHub
一个小小的云fitting计算器，计算方法来自微博@摸发虱痒
☆11Apr 10, 2022Updated 4 years ago
changxuding / Residual_Echo_Cancellation
View on GitHub
Various Algorithm for Residual Echo Cancellation
☆32Jul 6, 2023Updated 3 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago