rhasspy / pysilero-vadLinks
Mike/Projects/pysilero-vad.git
☆22Updated this week
Alternatives and similar repositories for pysilero-vad
Users that are interested in pysilero-vad are comparing it to the libraries listed below
Sorting:
- Python bindings of speexdsp noise suppression library☆44Updated 3 years ago
- Python Wrapper of Silero VAD☆62Updated 7 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆103Updated last year
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆41Updated 7 months ago
- A curated list of awesome voice activity detection☆69Updated last year
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆124Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆58Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆97Updated 11 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆120Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆30Updated 2 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 2 years ago
- ☆20Updated 3 months ago
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 4 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆77Updated 3 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆37Updated 7 months ago
- ViSpeR: Multilingual Audio-Visual Speech Recognition☆54Updated 7 months ago
- ☆72Updated 2 months ago
- CTC decoder with hotwords for ASR.☆34Updated 8 months ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 6 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆91Updated 2 years ago
- Utilizes ONNX Runtime for audio denoising.☆97Updated last month
- Went online decode demo☆31Updated 4 years ago
- Onnx wrapper for espnet infrernce model☆169Updated 4 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆99Updated last year
- Voice activity engine benchmark framework☆21Updated 2 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆78Updated 3 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆29Updated 9 months ago