Audio-WestlakeU/Mel-McNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Audio-WestlakeU/Mel-McNet)

Audio-WestlakeU / Mel-McNet

The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]

☆26

Alternatives and similar repositories for Mel-McNet

Users that are interested in Mel-McNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
Audio-WestlakeU / Rec-RIR
View on GitHub
Official PyTorch implementation of 'Blind Room Impulse Response Identification via Reverberant Speech Spectrum Reconstruction' [Interspee…
☆34Jun 4, 2026Updated last month
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
zx1292982431 / TF-SkiMNet
View on GitHub
☆26Feb 8, 2025Updated last year
yhsong06 / LAU-Net
View on GitHub
☆16May 23, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
Audio-WestlakeU / VINP
View on GitHub
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…
☆36Feb 23, 2026Updated 5 months ago
Max1Wz / H-GTCRN
View on GitHub
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)
☆111Mar 13, 2026Updated 4 months ago
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
zx1292982431 / Loud-Loss
View on GitHub
☆23Sep 16, 2025Updated 10 months ago
Jokejiangv / LABNet
View on GitHub
The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…
☆49Oct 10, 2025Updated 9 months ago
gitwukeyi / FSPEN
View on GitHub
☆59Apr 24, 2024Updated 2 years ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Jul 20, 2026Updated last week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
narrietal / Fast-ULCNet
View on GitHub
Official repository of Fast-ULCNet.
☆39Jun 17, 2026Updated last month
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
Shybert-AI / AEC-Two-Stage-Based
View on GitHub
基于两阶段的声学回声消除系统 A Two-Stage-Based Acoustic Echo Cancellation System
☆17Feb 22, 2026Updated 5 months ago
ZhongshuHou / LSA
View on GitHub
Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)
☆28Sep 16, 2023Updated 2 years ago
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆163Nov 15, 2024Updated last year
kooBH / DSS
View on GitHub
[WIP]Direction based Multi-Channel Speech Separation
☆14Jan 25, 2024Updated 2 years ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
sp-uhh / ears_benchmark
View on GitHub
Generation scripts for EARS-WHAM and EARS-Reverb
☆48Jul 4, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tencent-ailab / UltraDualPathCompression
View on GitHub
A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation An…
☆66Feb 20, 2024Updated 2 years ago
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
liutaocode / AwesomeDiarizationDataset
View on GitHub
Both audio-only and audio-visual speaker diarization datasets are listed here.
☆16Feb 22, 2023Updated 3 years ago
aask1357 / fastenhancer
View on GitHub
Speed-optimized streaming neural speech enhancement network
☆137Jul 3, 2026Updated 3 weeks ago
deepnetni / msa-dpcrn
View on GitHub
☆21Jul 18, 2026Updated last week
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆363Jan 1, 2025Updated last year
cpystan / PSM
View on GitHub
Exploring Unsupervised Cell Recognition with Prior Self-activation Maps (MICCAI 2023)
☆13Oct 27, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ZLiNJU / AFC-SPEX
View on GitHub
Source code and audio samples for AFC-SPEX, an algorithm that can jointly perform acoustic feedback cancellation and speaker extraction.
☆40Nov 7, 2025Updated 8 months ago
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
kooBH / ULCNet
View on GitHub
[WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).
☆29May 29, 2024Updated 2 years ago
Clovermax / AED-TSVAD
View on GitHub
Attention-Based Encoder-Decoder Target-Speaker Voice Activity Detection for Robust Speaker Diarization
☆31Sep 22, 2025Updated 10 months ago
wangtianrui / APC-SNR
View on GitHub
Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch
☆28Jan 31, 2022Updated 4 years ago
JusperLee / TFACM
View on GitHub
☆24Jul 16, 2025Updated last year
NikolaiKyhne / xLSTM-SENet
View on GitHub
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)
☆60Aug 28, 2025Updated 11 months ago