nanless / universal-speech-enhancementLinks
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆69Updated last year
Alternatives and similar repositories for universal-speech-enhancement
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆74Updated 3 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆50Updated 6 months ago
- ☆51Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆74Updated 5 months ago
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆71Updated 3 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆89Updated 7 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Updated last year
- ☆65Updated 2 years ago
- ☆54Updated 2 years ago
- Implementation of SpatialCodec.☆64Updated 2 years ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆95Updated 3 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆52Updated 2 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆97Updated 2 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆65Updated 5 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆43Updated 5 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆108Updated last year
- ☆66Updated 2 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆71Updated 4 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆76Updated last year
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆45Updated 8 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆36Updated 3 weeks ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- ☆49Updated 7 months ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆38Updated 4 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆84Updated 5 months ago
- Exploring Binary Classification Loss for Speaker Verification☆18Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago