nanless / universal-speech-enhancementLinks
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆63Updated last year
Alternatives and similar repositories for universal-speech-enhancement
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
Sorting:
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated 3 months ago
- ☆49Updated 10 months ago
- ☆54Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 4 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆68Updated this week
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆79Updated 4 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 11 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆37Updated 2 months ago
- Implementation of SpatialCodec.☆59Updated last year
- ☆65Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆35Updated last year
- ☆63Updated last year
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆69Updated 11 months ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated 2 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆44Updated last month
- ☆47Updated 4 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 10 months ago
- ☆26Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆75Updated 2 months ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆69Updated 3 years ago
- ☆69Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆101Updated 11 months ago
- Spherical residual vector quantization (SRVQ)☆30Updated 11 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆40Updated 8 months ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆78Updated this week
- The implementation of MDNet, which is in submission to Interspeech2022☆13Updated 3 years ago