nanless/universal-speech-enhancement

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nanless/universal-speech-enhancement)

nanless / universal-speech-enhancement

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec loss, bandwidth limitations, and other forms of degradation.

☆82

Alternatives and similar repositories for universal-speech-enhancement

Users that are interested in universal-speech-enhancement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zelokuo / VPIDM
View on GitHub
This is official repository of new SOTA diffusion models based method for speech enhancement
☆43Jul 31, 2024Updated last year
Max1Wz / H-GTCRN
View on GitHub
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)
☆111Mar 13, 2026Updated 4 months ago
cisco-open / pase
View on GitHub
PASE: Phonologically Anchored Speech Enhancer
☆86Jul 15, 2026Updated last week
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
gitwukeyi / FSPEN
View on GitHub
☆59Apr 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuguochencuc / BAE-Net
View on GitHub
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
☆80Aug 20, 2024Updated last year
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
Jokejiangv / LABNet
View on GitHub
The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…
☆49Oct 10, 2025Updated 9 months ago
haoxiangsnr / spiking-fullsubnet
View on GitHub
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
☆142Jan 28, 2026Updated 5 months ago
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆162Nov 15, 2024Updated last year
aleXiehta / AD-FlowTSE
View on GitHub
Adaptive Flow-Matching for Target Speaker Extraction
☆39Jul 13, 2026Updated last week
junyuchen-cjy / DTTNet-Pytorch
View on GitHub
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
☆109Mar 19, 2024Updated 2 years ago
ASLP-lab / SenSE
View on GitHub
Official code of SenSE.
☆90Oct 30, 2025Updated 8 months ago
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
Emrys365 / se-scaling
View on GitHub
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…
☆41Aug 7, 2024Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
sungwon23 / BSRNN
View on GitHub
☆138Apr 24, 2023Updated 3 years ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
HaoFengyuan / X-TF-GridNet
View on GitHub
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆114Sep 2, 2025Updated 10 months ago
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆107Jul 23, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
Honee-W / FlowSE
View on GitHub
Official repository for FlowSE (Interspeech 2025)
☆111Jul 9, 2025Updated last year
seorim0 / SE-using-SRL-Model
View on GitHub
Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings
☆21Jun 6, 2025Updated last year
Okrio / FSPEN
View on GitHub
☆21Apr 27, 2024Updated 2 years ago
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆22Updated this week
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
sp-uhh / ears_benchmark
View on GitHub
Generation scripts for EARS-WHAM and EARS-Reverb
☆48Jul 4, 2025Updated last year
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
Kevin-naticl / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆105Apr 1, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆274Dec 12, 2025Updated 7 months ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
amanteur / BandSplitRNN-PyTorch
View on GitHub
Unofficial PyTorch implementation of Music Source Separation with Band-split RNN
☆191Jun 10, 2024Updated 2 years ago
cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement
View on GitHub
☆161Jan 30, 2024Updated 2 years ago
viewfinder-annn / AnyEnhance-v1
View on GitHub
AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1
☆64May 21, 2026Updated 2 months ago
zeroone-universe / RealTimeBWE
View on GitHub
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
☆41Oct 20, 2025Updated 9 months ago
Xiaobin-Rong / ul-unas
View on GitHub
The official repo of UL-UNAS, an ultra-lightweight SE model.
☆192Jun 17, 2026Updated last month