Audio-WestlakeU / Mel-McNetLinks
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆17Updated 4 months ago
Alternatives and similar repositories for Mel-McNet
Users that are interested in Mel-McNet are comparing it to the libraries listed below
Sorting:
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆15Updated 5 months ago
- ☆19Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆33Updated 10 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 8 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆20Updated 3 years ago
- ☆51Updated last year
- ☆24Updated 2 years ago
- Neural network density models for speech separation.☆20Updated 4 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆41Updated 3 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated last week
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆35Updated last year
- A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆21Updated 2 weeks ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Updated 3 months ago
- ☆13Updated 2 months ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Updated 2 years ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆42Updated last year
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Updated 4 years ago
- ☆51Updated last year
- ☆11Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆47Updated 7 months ago
- Official code of SenSE.☆53Updated last week
- Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…☆16Updated 3 years ago
- ☆35Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆38Updated last year
- ☆25Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆36Updated this week
- This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''☆13Updated 10 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago