nanless / universal-speech-enhancement
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.
☆45Updated 6 months ago
Alternatives and similar repositories for universal-speech-enhancement:
Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆39Updated last month
- ☆46Updated 5 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆66Updated last week
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆36Updated 2 months ago
- ☆26Updated last year
- real-time speech enhance☆12Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 6 months ago
- ☆48Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ☆64Updated last year
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 6 months ago
- This is the official implementation of the LiSenNet☆55Updated 3 months ago
- ☆12Updated 3 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆86Updated 5 months ago
- ☆25Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆31Updated last year
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆24Updated 4 months ago
- ☆41Updated 9 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 3 months ago
- ☆46Updated 2 months ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆40Updated 2 years ago
- An example of a speech enhancement model deployed with TensorRT.☆45Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆45Updated last month
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆32Updated 7 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆49Updated last month
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆23Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆38Updated 6 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆30Updated 4 months ago