zhenghuatan / rVADLinks

Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

☆136

Alternatives and similar repositories for rVAD

Users that are interested in rVAD are comparing it to the libraries listed below

Sorting:

zhenghuatan / rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…
☆142Updated last month
chenzhuo1011 / libri_css
Libri-CSS: dataset and evaluation pipeline
☆147Updated 2 years ago
xuchenglin28 / speaker_extraction
target speaker extraction and verification for multi-talker speech
☆180Updated 4 years ago
funcwj / aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
☆143Updated 2 years ago
yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆122Updated 3 years ago
Xflick / EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
☆108Updated 2 years ago
fgnt / pb_chime5
Speech enhancement system for the CHiME-5 dinner party scenario
☆109Updated 5 months ago
idiap / acoustic-simulator
Implementation of audio degradation processes
☆103Updated 9 years ago
desh2608 / gss
A simple package for Guided source separation (GSS)
☆126Updated last year
ConferencingSpeech / ConferencingSpeech2021
Conferencing Speech Challenge
☆96Updated 4 years ago
desh2608 / dover-lap
Python package for combining diarization system outputs.
☆88Updated last year
nttcslab-sp / EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆77Updated 2 years ago
SungFeng-Huang / SSL-pretraining-separation
Official repository of our paper: https://arxiv.org/abs/2010.15366
☆63Updated 3 years ago
BUTSpeechFIT / VBx
Variational Bayes HMM over x-vectors diarization
☆273Updated last year
dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆108Updated 2 years ago
tango4j / Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆121Updated 3 years ago
speechLabBcCuny / onssen
An open-source speech separation and enhancement library
☆213Updated 5 years ago
yuguochencuc / DB-AIAT
The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"
☆121Updated 3 years ago
BUTSpeechFIT / speakerbeam
☆123Updated 3 years ago
dodohow1011 / TS-VAD
☆50Updated 4 years ago
yluo42 / TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
☆281Updated 4 years ago
tencent-ailab / FRA-RIR
☆198Updated last year
NickWilkinson37 / voxseg
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
☆88Updated 2 years ago
huyanxin / phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
☆232Updated last year
funcwj / setk
Tools for Speech Enhancement integrated with Kaldi
☆416Updated 2 years ago
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆155Updated last month
BUTSpeechFIT / AMI-diarization-setup
☆54Updated last year
mpariente / pytorch_stoi
STOI loss function in PyTorch
☆92Updated 10 months ago
funcwj / conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…
☆212Updated 2 years ago
iiscleap / NeuralPlda
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Updated 5 years ago