Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆24Aug 21, 2024Updated last year
Alternatives and similar repositories for silero-vad-ncnn
Users that are interested in silero-vad-ncnn are comparing it to the libraries listed below
Sorting:
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- Pure C# port of the Pocketsphinx keyword spotter☆13Jan 19, 2020Updated 6 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 3 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- ☆16Apr 24, 2025Updated 10 months ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆23Jul 17, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆43Mar 9, 2022Updated 4 years ago
- SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synth…☆22Sep 5, 2024Updated last year
- 达摩fsmn vad c++推理服务☆18Apr 17, 2023Updated 2 years ago
- Real-Time De-noising and De-reverbing with Tiny Recurrent UNet☆55Jun 7, 2023Updated 2 years ago
- Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…☆47Jul 17, 2024Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆155Aug 9, 2025Updated 7 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 5 months ago
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- Podcast Summarizer with LLM Technology☆30May 28, 2025Updated 9 months ago
- ☆81Jun 25, 2025Updated 8 months ago
- Source code for Consistent ensemble distillation for audio tagging☆59Jun 12, 2025Updated 8 months ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆29Feb 4, 2025Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Jul 16, 2024Updated last year
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆69Jul 19, 2025Updated 7 months ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆27Apr 11, 2024Updated last year
- RNNOISE Noise elimination, MCRA noise estimation, OMLSA post filtering☆30Jan 23, 2023Updated 3 years ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆578Jan 18, 2026Updated last month