lovemefan/Silero-vad-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lovemefan/Silero-vad-pytorch)

lovemefan / Silero-vad-pytorch

silero-vad pytorch implement

☆38

Alternatives and similar repositories for Silero-vad-pytorch

Users that are interested in Silero-vad-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xmos / fwk_voice
View on GitHub
Voice Framework
☆18Jan 21, 2026Updated 6 months ago
joonaskalda / PixIT
View on GitHub
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆105Jan 10, 2025Updated last year
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
zhuzizyf / damo-fsmn-vad-infer-httpserver
View on GitHub
达摩fsmn vad c++推理服务
☆17Apr 17, 2023Updated 3 years ago
ZhaoF-i / SDAEC
View on GitHub
☆19Jan 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
StellanLi / EchoFree
View on GitHub
☆18Feb 22, 2025Updated last year
Clovermax / AED-TSVAD
View on GitHub
Attention-Based Encoder-Decoder Target-Speaker Voice Activity Detection for Robust Speaker Diarization
☆31Sep 22, 2025Updated 10 months ago
daihuangyu / speex_aec_kf
View on GitHub
speex aec kalman filter
☆15Mar 17, 2024Updated 2 years ago
ThomasHaubner / e2e_dnn_ad_control_for_lin_aec
View on GitHub
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
☆45Nov 17, 2023Updated 2 years ago
ZhaoF-i / ASTWS-AEC
View on GitHub
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
☆31Nov 12, 2025Updated 8 months ago
vackva / Orbe
View on GitHub
Binaural Spatializer Audio Plugin
☆25Jun 25, 2024Updated 2 years ago
mohit-nith / GeneralizedWOLA-SystemIdentification
View on GitHub
Subband system identification using generalized Weighted Overlap-Add (WOLA) filter bank for improved acoustic echo cancellation.
☆15May 8, 2025Updated last year
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆23Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Okrio / deepvqe
View on GitHub
☆14Oct 12, 2023Updated 2 years ago
andyye1999 / Daily-study-notes
View on GitHub
每日学习笔记
☆69Dec 12, 2025Updated 7 months ago
xmos / sln_voice
View on GitHub
XCORE-VOICE Solution
☆20Apr 8, 2026Updated 3 months ago
Jokejiangv / LABNet
View on GitHub
The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…
☆49Oct 10, 2025Updated 9 months ago
rrbluke / NRES
View on GitHub
Neural Residual Echo Suppressor
☆51Aug 16, 2021Updated 4 years ago
Taltt / FNSE-SAT
View on GitHub
☆46Jan 14, 2025Updated last year
nttcslab-sp / mamba-diarization
View on GitHub
Official repository for Mamba-based Segmentation Model for Speaker Diarization
☆47May 13, 2025Updated last year
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HolgerBovbjerg / SSL-PVAD
View on GitHub
A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…
☆25Nov 25, 2024Updated last year
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 5 months ago
RapidAI / RapidSpeech.cpp
View on GitHub
On-device speech AI runtime for ASR, TTS, VAD, and voice cloning. Python-simple, C++-native, GGUF-powered.
☆22Jul 15, 2026Updated 2 weeks ago
NiniAndy / Paraformer-V2
View on GitHub
来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆29Nov 20, 2024Updated last year
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
xiaochunxin / OMLSA-MCRA
View on GitHub
C++ speech enhancement base on OMLSA-MCRA
☆63Aug 4, 2020Updated 5 years ago
leospark / FireRedVAD-Engineering
View on GitHub
Lightweight streaming Voice Activity Detection (VAD) tool with ONNX runtime
☆24Mar 18, 2026Updated 4 months ago
875441459 / Design_DMA
View on GitHub
An implementation of frequency-invariant beamformer
☆14Sep 3, 2021Updated 4 years ago
hshi-speech / Research-and-Analysis-of-Speech-Enhancement-or-Dereverberation
View on GitHub
This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further u…
☆47Jul 6, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
William1617 / REAL_TIME_NKF_AEC
View on GitHub
☆24Jul 29, 2024Updated 2 years ago
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
dpwe / pitchfilter
View on GitHub
Speech enhancement by time-varying pitch-dependent filtering of harmonics
☆27Jul 3, 2014Updated 12 years ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆83Jul 29, 2024Updated 2 years ago
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago