nanless / universal-speech-enhancement
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆61Updated 8 months ago
Alternatives and similar repositories for universal-speech-enhancement:
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆47Updated last week
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆42Updated last week
- ☆47Updated 7 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆32Updated 8 months ago
- ☆65Updated last year
- ☆49Updated 2 years ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆37Updated 4 months ago
- ☆26Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 3 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆39Updated 8 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 8 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated last month
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆69Updated 3 weeks ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆39Updated 7 months ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- ☆42Updated last year
- Implementation of SpatialCodec.☆56Updated last year
- The official repo of UL-UNAS, an ultra-lightweight SE model.☆38Updated last month
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆26Updated 4 months ago
- ☆18Updated last year
- ☆13Updated 5 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆54Updated 6 months ago
- This is the official implementation of the LiSenNet☆81Updated 5 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆38Updated last month
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆32Updated last year
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆40Updated 2 years ago
- ☆25Updated 2 years ago