sp-uhh / sgmseLinks
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
☆645Updated last month
Alternatives and similar repositories for sgmse
Users that are interested in sgmse are comparing it to the libraries listed below
Sorting:
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆414Updated 3 months ago
- Conformer-based Metric GAN for speech enhancement☆374Updated last year
- An open source dataset for source separation☆439Updated last year
- Conditional Diffusion Probabilistic Model for Speech Enhancement☆242Updated 2 years ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆232Updated 11 months ago
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆327Updated last year
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆593Updated last year
- PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."☆574Updated 2 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆375Updated last year
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆266Updated 3 weeks ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆331Updated 2 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆336Updated 2 years ago
- ☆443Updated last year
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆217Updated 4 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆468Updated last year
- see README☆356Updated last month
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆293Updated 7 months ago
- This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf☆399Updated 3 years ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆485Updated 5 months ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆210Updated 2 months ago
- Speaker embedding (d-vector) trained with GE2E loss☆284Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆995Updated 2 years ago
- PPG-Based Voice Conversion☆344Updated 3 years ago
- Python implementation of performance metrics in Loizou's Speech Enhancement book☆431Updated 6 months ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆408Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆680Updated 7 months ago
- General Speech Restoration☆282Updated last year
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆724Updated 2 years ago
- Keep track of big models in audio domain, including speech, singing, music etc.☆492Updated 10 months ago
- PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)☆601Updated 11 months ago