nanless / universal-speech-enhancement
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆61Updated 8 months ago
Alternatives and similar repositories for universal-speech-enhancement:
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆47Updated last week
- ☆47Updated 7 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆42Updated last week
- ☆65Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- ☆49Updated 2 years ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆37Updated 4 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆32Updated 8 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 3 months ago
- ☆42Updated last year
- This is the official implementation of the LiSenNet☆81Updated 5 months ago
- ☆48Updated 3 weeks ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆39Updated 8 months ago
- Implementation of SpatialCodec.☆56Updated last year
- ☆13Updated 5 months ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆39Updated 7 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated last month
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆57Updated 2 weeks ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆32Updated last year
- ☆26Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆93Updated 7 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆38Updated last month
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 8 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆58Updated 8 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆36Updated 5 months ago
- ☆69Updated 2 years ago
- real-time speech enhance☆14Updated last year