nanless / universal-speech-enhancementLinks
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆63Updated last year
Alternatives and similar repositories for universal-speech-enhancement
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆69Updated 2 weeks ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆47Updated 4 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆82Updated 4 months ago
- ☆49Updated 11 months ago
- ☆54Updated 2 years ago
- ☆65Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated last year
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆63Updated 2 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆39Updated 3 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆35Updated last year
- ☆48Updated 4 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆69Updated last year
- Implementation of SpatialCodec.☆59Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆48Updated 2 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆87Updated 3 weeks ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated last week
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Production first, nn-based on-device signal processing toolkit.☆64Updated 2 years ago
- ☆63Updated 2 years ago
- ☆26Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆104Updated 11 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆41Updated 9 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆18Updated this week
- Spherical residual vector quantization (SRVQ)☆30Updated last year
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆106Updated 2 months ago
- faster inference☆28Updated 7 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆80Updated 8 months ago