nanless / universal-speech-enhancementLinks
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆63Updated 10 months ago
Alternatives and similar repositories for universal-speech-enhancement
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆60Updated this week
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆46Updated 2 months ago
- ☆48Updated 9 months ago
- ☆53Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 10 months ago
- ☆65Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆74Updated 2 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 3 months ago
- ☆26Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- Implementation of SpatialCodec.☆58Updated last year
- ☆47Updated 2 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 10 months ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆41Updated 6 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆36Updated last month
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated 10 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆39Updated 7 months ago
- Production first, nn-based on-device signal processing toolkit.☆65Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆68Updated 3 years ago
- faster inference☆28Updated 5 months ago
- ☆47Updated last year
- Streaming Audiotransformers for online Audio tagging☆45Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆34Updated last year
- ☆69Updated 2 years ago
- ☆20Updated last year
- Official repository for FlowSE (Interspeech 2025)☆18Updated last week
- ☆62Updated last year
- Pytorch Models for Speech Enhancement☆21Updated 2 years ago