nanless / universal-speech-enhancement
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆58Updated 8 months ago
Alternatives and similar repositories for universal-speech-enhancement:
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
- ☆46Updated 6 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆45Updated last week
- ☆48Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆42Updated 3 months ago
- This is the official implementation of the LiSenNet☆69Updated 4 months ago
- ☆65Updated last year
- ☆26Updated last year
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆37Updated 3 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 7 months ago
- Implementation of SpatialCodec.☆56Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated 2 weeks ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆38Updated 8 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 2 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆36Updated 3 weeks ago
- ☆42Updated 11 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆52Updated 2 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆68Updated 7 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆32Updated last year
- ☆25Updated 2 years ago
- ☆61Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- The official repo of UL-UNAS, an ultra-lightweight SE model.☆31Updated 3 weeks ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆39Updated 6 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆65Updated 3 weeks ago
- real-time speech enhance☆13Updated last year
- ☆13Updated 4 months ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- ☆25Updated last year