SungFeng-Huang / SSL-pretraining-separation
Official repository of our paper: https://arxiv.org/abs/2010.15366
☆62Updated 3 years ago
Alternatives and similar repositories for SSL-pretraining-separation:
Users that are interested in SSL-pretraining-separation are comparing it to the libraries listed below
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆116Updated 2 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆121Updated 2 years ago
- Libri-CSS: dataset and evaluation pipeline☆145Updated 2 years ago
- SpEx+(tied) source code☆83Updated last year
- ☆110Updated 4 years ago
- Conferencing Speech Challenge☆94Updated 4 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆58Updated 4 years ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆103Updated 2 years ago
- STOI loss function in PyTorch☆91Updated 7 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)☆73Updated 4 years ago
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆44Updated 3 years ago
- ☆50Updated 2 years ago
- ☆118Updated 3 years ago
- DCCRN with various loss functions☆95Updated 2 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆112Updated 2 years ago
- Beam-guided TasNet☆51Updated 2 years ago
- ☆190Updated last year
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆78Updated 3 years ago
- ☆93Updated 4 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 3 months ago
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆51Updated 3 years ago
- ☆55Updated 11 months ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 5 years ago
- A fast implementation of bss_eval metrics for blind source separation☆135Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- multi-scale time domain speaker extraction☆63Updated 3 years ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆142Updated last year
- ☆51Updated 3 years ago
- ☆41Updated 5 years ago