robin1001/vad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/robin1001/vad)

robin1001 / vad

simple energy vad

☆19

Alternatives and similar repositories for vad

Users that are interested in vad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
robin1001 / nn-vad
View on GitHub
simple dnn based vad
☆69Dec 2, 2018Updated 7 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 2 months ago
robin1001 / webrtcvad
View on GitHub
vad wraper on webrtcvad
☆25Jun 3, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaustubh-iamplus / webrtc_vad
View on GitHub
Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine
☆11Mar 1, 2018Updated 8 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 10 months ago
TUT-ARG / DCASE2016-baseline-system-matlab
View on GitHub
☆13Jan 10, 2017Updated 9 years ago
robin1001 / kaldi-aslp
View on GitHub
☆43Jun 25, 2018Updated 8 years ago
vinusankars / ESOLA
View on GitHub
Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.
☆23Jul 24, 2020Updated 5 years ago
mmorise / tusk
View on GitHub
A framework for overviewing the performance of F0 estimators
☆19Sep 10, 2016Updated 9 years ago
BYRTIMO / END-TO-END-SPEECH-ENHANCEMENT-BASED-ON-DISCRETE-COSINE-TRANSFORM
View on GitHub
☆18Nov 10, 2019Updated 6 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
pengzhendong / welm
View on GitHub
One command to build TLG.fst for WeNet.
☆30Oct 11, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
idiap / apam
View on GitHub
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…
☆14Feb 15, 2021Updated 5 years ago
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
galv / galvASR
View on GitHub
ASR library
☆14Dec 3, 2018Updated 7 years ago
IMLHF / WFb_SE
View on GitHub
(tensorflow) Wiener Filter based Speech Enhancement（LSTM/BLSTM, GRU/BGRU, Transformer）
☆15Dec 3, 2019Updated 6 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
PengdaLiu / LAS-SpeechRecognition
View on GitHub
Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).
☆32Jun 27, 2019Updated 7 years ago
wenet-e2e / nn-singal-processing-papers
View on GitHub
List of NN based singal processing papers
☆23Jun 5, 2023Updated 3 years ago
jefflai108 / Attentive-Filtering-Network
View on GitHub
University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.
☆50May 1, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
naxingyu / kaldi_cvte_model_test
View on GitHub
This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)
☆15May 30, 2019Updated 7 years ago
uhh-lt / MeetingBot
View on GitHub
Minute Meeting Bot
☆20Mar 4, 2023Updated 3 years ago
MycroftAI / pylisten
View on GitHub
A simple pyaudio microphone interface
☆11Jul 27, 2018Updated 7 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
pengzhendong / torchfa
View on GitHub
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆61Sep 5, 2025Updated 10 months ago
Debapriya-Tula / Speech_Dereverberation
View on GitHub
Speech Dereverberation using weighted prediction error
☆11Dec 22, 2019Updated 6 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
placebokkk / e6870
View on GitHub
assignments for e6870 ASR class
☆42Apr 23, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
Mashiro009 / wenet-onnx
View on GitHub
☆33Aug 6, 2021Updated 4 years ago
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
SergMa / free-nross
View on GitHub
Free noise reduction of speech signals
☆12Jul 26, 2016Updated 9 years ago
Tzenthin / wenet_mnn
View on GitHub
语音识别模型pytorch转ONNX转MNN，C++实现部署
☆85Sep 1, 2022Updated 3 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
Hguimaraes / SEWUNet
View on GitHub
[Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)
☆32Nov 22, 2022Updated 3 years ago