wxqwinner/silero-vad-ncnn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wxqwinner/silero-vad-ncnn)

wxqwinner / silero-vad-ncnn

Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.

☆26

Alternatives and similar repositories for silero-vad-ncnn

Users that are interested in silero-vad-ncnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wxqwinner / gtcrn-ncnn
View on GitHub
GTCRN(ncnn).
☆19May 22, 2025Updated last year
William1617 / gtcrn_c
View on GitHub
☆24Jul 17, 2024Updated 2 years ago
zhuzizyf / damo-fsmn-vad-infer-httpserver
View on GitHub
达摩fsmn vad c++推理服务
☆17Apr 17, 2023Updated 3 years ago
magicse / ncnn-hifi-GAN
View on GitHub
ncnn HiFi-GAN
☆30Sep 29, 2024Updated last year
ouyangkk / speech_enhancement_rnnoise_mcra
View on GitHub
RNNOISE Noise elimination, MCRA noise estimation, OMLSA post filtering
☆31Jan 23, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ctwgL / webrtc-beamforming
View on GitHub
整理出来的webrtc波束模块
☆40Apr 7, 2021Updated 5 years ago
Okrio / tinyrecurrentunet
View on GitHub
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
☆56Jun 7, 2023Updated 3 years ago
neonbjb / BigListOfPodcasts
View on GitHub
A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.
☆44Mar 9, 2022Updated 4 years ago
rickie-mi / Adaptive-Speech-Dereverberation-based-on-QR-MCLP-model
View on GitHub
根据论文《Multi-Channel Linear Prediction Speech Dereverberation Algorithm Based on QR-RLS Adaptive Filter》写了一个C代码去实现双通道语音去混响的功能
☆30Jan 3, 2022Updated 4 years ago
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
YoungJay0612 / Single-Channel-Speech-Enhancement
View on GitHub
Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…
☆52Jul 17, 2024Updated 2 years ago
DakeQQ / Audio-Denoiser-ONNX
View on GitHub
Utilizes ONNX Runtime for audio denoising.
☆134Updated this week
jark006 / SummerTTS_VS
View on GitHub
SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目，可以本地运行不需要网络，而且没有额外的依赖，一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synth…
☆22Sep 5, 2024Updated last year
lovemefan / Silero-vad-pytorch
View on GitHub
silero-vad pytorch implement
☆38Nov 23, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ctwgL / webrtc_agc2
View on GitHub
demo for webrtc agc2
☆36Dec 25, 2021Updated 4 years ago
NiniAndy / Paraformer-V2
View on GitHub
来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆29Nov 20, 2024Updated last year
zhangsu / seal
View on GitHub
Scorched End Audio Library: a C library (with Ruby binding) for 3D audio rendering.
☆22Oct 12, 2014Updated 11 years ago
chenyangMl / keyword-spot
View on GitHub
端到端语音唤醒工具箱，从模型训练到模型推理。
☆164Jun 12, 2026Updated last month
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
DakeQQ / STFT-ISTFT-ONNX
View on GitHub
Export the STFT or ISTFT process in ONNX format.
☆47Jun 6, 2026Updated last month
yuzhouhe2000 / OMLSA-IMCRA
View on GitHub
Python implementation of OMLSA+IMCRA algorithm for speech enhancement.
☆70Jun 29, 2021Updated 5 years ago
helloooideeeeea / RealTimeCutVADCXXLibrary
View on GitHub
C++ implementation of real-time Voice Activity Detection (VAD) using Silero models with ONNX Runtime and WebRTC Audio Processing. Provide…
☆14Feb 19, 2026Updated 5 months ago
xiaochunxin / OMLSA-MCRA
View on GitHub
C++ speech enhancement base on OMLSA-MCRA
☆63Aug 4, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ml-inory / melotts.axera
View on GitHub
MeloTTS demo on Axera
☆14Jul 1, 2026Updated 3 weeks ago
cvqluu / MTL-Speaker-Embeddings
View on GitHub
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆26Oct 5, 2022Updated 3 years ago
PINTO0309 / onnx-aec
View on GitHub
A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.
☆13Oct 22, 2024Updated last year
bglid / GTCRN-Micro
View on GitHub
Rebuild of GTCRN using Grouped TCNs, amidst other changes. Initially an attempt to target MCU deployment.
☆26Jan 12, 2026Updated 6 months ago
rpdswtk / vsmqtt
View on GitHub
VSMqtt is a simple MQTT client integrated in vscode.
☆17Updated this week
xFinal / nsfw-mobile-ncnn
View on GitHub
nsfw & porn detection for mobile in ncnn
☆12Dec 24, 2021Updated 4 years ago
Yifei-ZHAO96 / Tr-VAD
View on GitHub
Tr-VAD: An Efficient Transformer based Voice Activity Detection Model
☆18Aug 1, 2024Updated last year
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
EdVince / PiDiNet-NCNN
View on GitHub
PiDiNet running in Android by ncnn
☆15Sep 26, 2021Updated 4 years ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Updated this week
spatialaudio / lf-corrected-kemar-hrtfs
View on GitHub
KEMAR HRTFs with low frequency correction
☆13Mar 20, 2017Updated 9 years ago
xcore / sw_audio_effects
View on GitHub
Audio effect components
☆12Apr 16, 2014Updated 12 years ago
ahmedshah1494 / speech_robust_bench
View on GitHub
☆18Apr 24, 2025Updated last year
miquelramirez / clam
View on GitHub
CLAM: C++ Library for Audio and Music (unofficial mirror)
☆10May 11, 2017Updated 9 years ago