rhasspy / pysilero-vadLinks

Mike/Projects/pysilero-vad.git

☆19

Alternatives and similar repositories for pysilero-vad

Users that are interested in pysilero-vad are comparing it to the libraries listed below

Sorting:

dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆108Updated 2 years ago
aizhiqi-work / MM-KWS
Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆35Updated 2 months ago
Wadaboa / titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
☆64Updated 2 years ago
vineeths96 / Spoken-Keyword-Spotting
In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…
☆102Updated 2 years ago
bigcash / awesome-vad
A curated list of awesome voice activity detection
☆59Updated 8 months ago
pengzhendong / pyannote-onnx
ONNX Inference of Pyannote Segmentation
☆92Updated 7 months ago
zhenghuatan / rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…
☆142Updated 2 months ago
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆57Updated last year
roatienza / efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
☆174Updated last year
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆88Updated last year
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆29Updated 5 months ago
espnet / espnet_onnx
Onnx wrapper for espnet infrernce model
☆168Updated 10 months ago
rishikksh20 / LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
☆92Updated 3 years ago
wenet-e2e / wesep
Target Speaker Extraction Toolkit
☆184Updated 2 weeks ago
shangeth / SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
☆66Updated 4 years ago
pirxus / personalVAD
An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.
☆70Updated 2 years ago
NickWilkinson37 / voxseg
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
☆88Updated 2 years ago
csukuangfj / kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
☆113Updated 3 weeks ago
skit-ai / SpeechLLM
This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…
☆115Updated last year
mrusci / ondevice-learning-kws
Test Framework for few-shot open set KWS
☆32Updated 9 months ago
pengzhendong / pysilero
Python Wrapper of Silero VAD
☆57Updated 3 months ago
k2-fsa / colab
Colab notebooks for Next-gen Kaldi
☆28Updated 4 months ago
kaistmm / Metric-UD-KWS
Official code for Metric learning for user-defined keyword spotting
☆34Updated last year
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆156Updated last month
yl4579 / PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆260Updated 6 months ago
TeaPoly / speexdsp-ns-python
Python bindings of speexdsp noise suppression library
☆40Updated 2 years ago
VoxBlink2 / ScriptsForVoxBlink2
Official Repository For VoxBlink2
☆76Updated 11 months ago
vectominist / MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆54Updated 2 years ago
harvard-edge / multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆176Updated 8 months ago
k2-fsa / next-gen-kaldi-wechat
☆38Updated last year