HolgerBovbjerg/SSL-PVAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HolgerBovbjerg/SSL-PVAD)

HolgerBovbjerg / SSL-PVAD

A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIVITY DETECTION IN ADVERSE CONDITIONS"

☆25

Alternatives and similar repositories for SSL-PVAD

Users that are interested in SSL-PVAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fireredchat-submodules / livekit-plugins-fireredchat-pvad
View on GitHub
FireRedChat pVAD plugin for LiveKit Agents
☆22Sep 16, 2025Updated 10 months ago
fclearner / Personal-vad-2.0
View on GitHub
Implementation of "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"
☆16Jun 9, 2026Updated last month
ddxsg24 / Personalized-Speech-Enhancement
View on GitHub
ASLP Summer Inter@NPU
☆13Jul 30, 2024Updated last year
REAL-TSE / wesep-real-tse
View on GitHub
☆36Apr 14, 2026Updated 3 months ago
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mathieulagrange / ddspMusicBandwidthExtension
View on GitHub
☆17Sep 19, 2023Updated 2 years ago
pirxus / personalVAD
View on GitHub
An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.
☆90Sep 22, 2022Updated 3 years ago
tan90xx / distillw2n
View on GitHub
🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features
☆25Dec 10, 2025Updated 7 months ago
Maokui-He / NSD-MA-MSE
View on GitHub
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
☆62Sep 19, 2024Updated last year
Xiaobin-Rong / deepvqe
View on GitHub
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
☆148Mar 24, 2025Updated last year
NikolaiKyhne / xLSTM-SENet
View on GitHub
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)
☆60Aug 28, 2025Updated 11 months ago
miemiekurisu / qwen3asr_cpu
View on GitHub
A high-performance C/C++ inference server for Qwen3-ASR , optimized for CPU/GPU real-time streaming speech recognition.
☆15Jun 27, 2026Updated last month
Xiaobin-Rong / TRT-SE
View on GitHub
An example of a speech enhancement model deployed with TensorRT.
☆88Mar 24, 2025Updated last year
JethroWangSir / SincQDR-VAD
View on GitHub
☆26Aug 29, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
leospark / FireRedVAD-Engineering
View on GitHub
Lightweight streaming Voice Activity Detection (VAD) tool with ONNX runtime
☆24Mar 18, 2026Updated 4 months ago
ZhaoF-i / SDAEC
View on GitHub
☆19Jan 6, 2025Updated last year
lovemefan / Silero-vad-pytorch
View on GitHub
silero-vad pytorch implement
☆38Nov 23, 2024Updated last year
changxuding / Residual_Echo_Cancellation
View on GitHub
Various Algorithm for Residual Echo Cancellation
☆32Jul 6, 2023Updated 3 years ago
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
yu-haoyuan / fd-badcat
View on GitHub
fd-sds
☆21Apr 8, 2026Updated 3 months ago
narrietal / Fast-ULCNet
View on GitHub
Official repository of Fast-ULCNet.
☆39Jun 17, 2026Updated last month
Clovermax / AED-TSVAD
View on GitHub
Attention-Based Encoder-Decoder Target-Speaker Voice Activity Detection for Robust Speaker Diarization
☆31Sep 22, 2025Updated 10 months ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DavidDiazGuerra / Cross3D
View on GitHub
Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
☆90Mar 24, 2023Updated 3 years ago
VikasTokala / BCCTN
View on GitHub
☆33Jun 10, 2025Updated last year
chentuochao / Target-Conversation-Extraction
View on GitHub
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…
☆58Aug 15, 2025Updated 11 months ago
isHuangZiling / SEF-PNet
View on GitHub
☆24Jul 10, 2025Updated last year
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
ncsoft / PhonMatchNet
View on GitHub
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆63Jun 3, 2024Updated 2 years ago
ceva-ip / DPDFNet
View on GitHub
Clean up noisy speech in real time with DPDFNet - open-source streaming speech enhancement for research, audio apps, and edge devices. In…
☆113Jul 22, 2026Updated last week
Dahan-Wang / Rethinking-Flow-and-Diffusion-Bridge-Models-for-Speech-Enhancement
View on GitHub
☆39Feb 23, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Yifei-ZHAO96 / Tr-VAD
View on GitHub
Tr-VAD: An Efficient Transformer based Voice Activity Detection Model
☆18Aug 1, 2024Updated last year
Jerry-jwz / Audio-Enhancement-via-ONMF
View on GitHub
☆23Feb 2, 2022Updated 4 years ago
liyunlongaaa / NSD-MS2S
View on GitHub
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…
☆88Jun 17, 2025Updated last year
gxu82 / MVDR-Speech-Enhancement
View on GitHub
☆16Jul 14, 2020Updated 6 years ago
taotaowang97479 / MFNet-SpeechEnhancement
View on GitHub
This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''
☆13Dec 20, 2024Updated last year
Max1Wz / H-GTCRN
View on GitHub
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)
☆111Mar 13, 2026Updated 4 months ago
ebezzam / room-simulation
View on GitHub
Supporting code for the paper "A study on more realistic room simulation for far-field keyword spotting".
☆34Oct 27, 2020Updated 5 years ago