nanless / universal-speech-enhancementView external linksLinks
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆76Jul 29, 2024Updated last year
Alternatives and similar repositories for universal-speech-enhancement
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
Sorting:
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Jul 31, 2024Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆102Mar 19, 2024Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- ☆52Sep 10, 2024Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Aug 7, 2024Updated last year
- ☆57Apr 24, 2024Updated last year
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆34Oct 11, 2025Updated 4 months ago
- ☆157Jan 30, 2024Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- ☆120Apr 24, 2023Updated 2 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆249Dec 12, 2025Updated 2 months ago
- ☆21Jul 16, 2025Updated 6 months ago
- ☆46Jan 14, 2025Updated last year
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆129Mar 24, 2023Updated 2 years ago
- This is the official implementation of the LiSenNet☆146Nov 15, 2024Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆55Aug 15, 2025Updated 5 months ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆55Apr 14, 2025Updated 10 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆100May 24, 2023Updated 2 years ago
- ☆206Dec 5, 2024Updated last year
- ☆20Apr 27, 2024Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆93Sep 2, 2025Updated 5 months ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆68Jun 7, 2022Updated 3 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆96Apr 1, 2025Updated 10 months ago
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆88Jul 23, 2025Updated 6 months ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆72May 11, 2024Updated last year
- ☆10Jun 24, 2021Updated 4 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- ☆19Aug 25, 2025Updated 5 months ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 8 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆209Jun 25, 2024Updated last year
- An efficient speech separation method☆297Apr 11, 2024Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago