SungFeng-Huang / SSL-pretraining-separationLinks
Official repository of our paper: https://arxiv.org/abs/2010.15366
☆63Updated 4 years ago
Alternatives and similar repositories for SSL-pretraining-separation
Users that are interested in SSL-pretraining-separation are comparing it to the libraries listed below
Sorting:
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆119Updated 2 years ago
- Libri-CSS: dataset and evaluation pipeline☆151Updated 2 years ago
- SpEx+(tied) source code☆89Updated 2 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆123Updated 3 years ago
- ☆114Updated 5 years ago
- ☆132Updated 4 years ago
- ☆55Updated 3 years ago
- Conferencing Speech Challenge☆95Updated 4 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆77Updated 4 years ago
- A simple package for Guided source separation (GSS)☆132Updated last year
- STOI loss function in PyTorch☆103Updated last year
- DCCRN with various loss functions☆103Updated 3 years ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆146Updated 2 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 11 months ago
- Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)☆76Updated 5 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆82Updated 4 years ago
- Speech separation with utterance-level PIT experiments☆105Updated 7 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆94Updated 2 years ago
- transformer based neural network for speech enhancement in time domain☆76Updated 3 years ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆125Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- ☆37Updated 4 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 3 years ago
- multi-scale time domain speaker extraction☆71Updated 4 years ago
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆45Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 6 years ago
- Speech Separation☆78Updated last year
- Beam-guided TasNet☆57Updated 3 years ago
- ☆51Updated 4 years ago
- Training data simulation☆58Updated last year