Audio-WestlakeU/CleanMel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Audio-WestlakeU/CleanMel)

Audio-WestlakeU / CleanMel

Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".

☆94

Alternatives and similar repositories for CleanMel

Users that are interested in CleanMel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Audio-WestlakeU / VINP
View on GitHub
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…
☆36Feb 23, 2026Updated 5 months ago
Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
Audio-WestlakeU / Rec-RIR
View on GitHub
Official PyTorch implementation of 'Blind Room Impulse Response Identification via Reverberant Speech Spectrum Reconstruction' [Interspee…
☆34Jun 4, 2026Updated last month
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ASLP-lab / SenSE
View on GitHub
Official code of SenSE.
☆90Oct 30, 2025Updated 8 months ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
Audio-WestlakeU / RCT
View on GitHub
This repo gives the code for the official implementation of RCT.
☆13Jun 28, 2022Updated 4 years ago
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
isHuangZiling / SEF-PNet
View on GitHub
☆24Jul 10, 2025Updated last year
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆362Jan 1, 2025Updated last year
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
RoyChao19477 / SEMamba
View on GitHub
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
☆274Dec 12, 2025Updated 7 months ago
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆162Nov 15, 2024Updated last year
Xiaobin-Rong / ul-unas
View on GitHub
The official repo of UL-UNAS, an ultra-lightweight SE model.
☆192Jun 17, 2026Updated last month
Max1Wz / H-GTCRN
View on GitHub
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)
☆111Mar 13, 2026Updated 4 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
NikolaiKyhne / xLSTM-SENet
View on GitHub
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)
☆60Aug 28, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Audio-WestlakeU / pytorch_lightning_template_for_beginners
View on GitHub
A pytorch template for beginners based on pytorch_lightning
☆51Feb 1, 2024Updated 2 years ago
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
Xiaobin-Rong / gtcrn
View on GitHub
The official implementation of GTCRN, an ultra-lightweight SE model.
☆695Jan 18, 2026Updated 6 months ago
gitwukeyi / FSPEN
View on GitHub
☆59Apr 24, 2024Updated 2 years ago
huaidanquede / MUSE-Speech-Enhancement
View on GitHub
Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…
☆58Mar 5, 2025Updated last year
Honee-W / FlowSE
View on GitHub
Official repository for FlowSE (Interspeech 2025)
☆111Jul 9, 2025Updated last year
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
urgent-challenge / urgent2025_challenge
View on GitHub
Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.
☆85May 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
haoxiangsnr / spiking-fullsubnet
View on GitHub
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
☆142Jan 28, 2026Updated 5 months ago
cisco-open / pase
View on GitHub
PASE: Phonologically Anchored Speech Enhancer
☆86Jul 15, 2026Updated last week
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
yxlu-0102 / MP-SENet
View on GitHub
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
☆493May 19, 2025Updated last year
aask1357 / fastenhancer
View on GitHub
Speed-optimized streaming neural speech enhancement network
☆136Jul 3, 2026Updated 3 weeks ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago