wxqwinner / silero-vad-ncnnView external linksLinks
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆24Aug 21, 2024Updated last year
Alternatives and similar repositories for silero-vad-ncnn
Users that are interested in silero-vad-ncnn are comparing it to the libraries listed below
Sorting:
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Pure C# port of the Pocketsphinx keyword spotter☆13Jan 19, 2020Updated 6 years ago
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 2 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- ☆16Apr 24, 2025Updated 9 months ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆23Jul 17, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆43Mar 9, 2022Updated 3 years ago
- 达摩fsmn vad c++推理服务☆18Apr 17, 2023Updated 2 years ago
- SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖 ,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synth…☆22Sep 5, 2024Updated last year
- Real-Time De-noising and De-reverbing with Tiny Recurrent UNet☆54Jun 7, 2023Updated 2 years ago
- Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…☆47Jul 17, 2024Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆154Aug 9, 2025Updated 6 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 4 months ago
- A toolkit dedicate for speech evaluation.☆24Sep 26, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- Source code for Consistent ensemble distillation for audio tagging☆56Jun 12, 2025Updated 8 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- Podcast Summarizer with LLM Technology☆30May 28, 2025Updated 8 months ago
- ☆81Jun 25, 2025Updated 7 months ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆29Feb 4, 2025Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Jul 16, 2024Updated last year
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆68Jul 19, 2025Updated 6 months ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆561Jan 18, 2026Updated last month
- RNNOISE Noise elimination, MCRA noise estimation, OMLSA post filtering☆30Jan 23, 2023Updated 3 years ago
- Multi-speaker separation, identification, diarization ALL-IN-ONE. It can isolate the target speaker from a conversation audio and do ASR.☆61Oct 13, 2025Updated 4 months ago