HolgerBovbjerg / SSL-PVAD
A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIVITY DETECTION IN ADVERSE CONDITIONS"
☆11Updated 5 months ago
Alternatives and similar repositories for SSL-PVAD
Users that are interested in SSL-PVAD are comparing it to the libraries listed below
Sorting:
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- Toolbox for Evaluation of AEC/AES Systems☆19Updated this week
- ☆20Updated 7 months ago
- The implementation of MDNet, which is in submission to Interspeech2022☆13Updated 3 years ago
- ☆11Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- ☆21Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 3 months ago
- offical code for Dense-TSNet☆12Updated 8 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Updated 2 months ago
- ☆26Updated 2 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Updated 2 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated 3 months ago
- ☆11Updated 2 months ago
- ☆25Updated 2 years ago
- ☆13Updated last month
- ☆12Updated last month
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆40Updated 2 years ago
- Dataset simulation for DPCCN.☆15Updated 2 years ago
- ☆13Updated last year
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Updated 2 years ago
- eran-shahar / Double-talk-Detection-aided-Residual-Echo-Suppression-via-Spectrogram-Masking-and-Refinement☆26Updated 2 years ago
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- ☆10Updated 7 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆16Updated last month
- Neural network density models for speech separation.☆20Updated 4 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Official repository for LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Att…☆16Updated 2 months ago