JacobLinCool/MPSENet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JacobLinCool/MPSENet)

JacobLinCool / MPSENet

Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.

☆21

Alternatives and similar repositories for MPSENet

Users that are interested in MPSENet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
ina-foss / InaGVAD
View on GitHub
Voice activity detection and speaker gender segmentation audiovisual corpus
☆16Jan 20, 2025Updated last year
ZygoteCode / VadSharp
View on GitHub
Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, …
☆10Apr 20, 2025Updated last year
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆106Jul 23, 2025Updated 11 months ago
sp-uhh / 2sderev
View on GitHub
Two-stage Dereverberation Algorithm using DNN-supported multi-channel linear filtering and single-channel non-linear post-filtering
☆15Jan 10, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 4 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
XapaJIaMnu / gLM
View on GitHub
A GPU language model, based on btree backed tries.
☆30Mar 6, 2018Updated 8 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
JaesungHuh / av-diarization
View on GitHub
Audio-visual diarization pipeline used for creating VoxConverse dataset
☆22Jun 6, 2025Updated last year
ICDM-UESTC / COSE
View on GitHub
The implementation of Paper: Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement.
☆16Sep 23, 2025Updated 9 months ago
yxlu-0102 / MP-SENet
View on GitHub
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
☆493May 19, 2025Updated last year
seongq / cascadingtwoflowmatching
View on GitHub
(Interspeech 2025, official code) Speech enhancement based on cascaded two flows
☆16Jun 18, 2026Updated last month
noisereduce / TorchSpectralGating
View on GitHub
TorchSpectralGate is a PyTorch-based implementation of Spectral Gating, an algorithm for denoising audio signals.
☆27Feb 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
yoongi43 / music_audio_enhancement_conformer
View on GitHub
Implementation of the paper "Exploiting Time-Frequency Conformers for Music Audio Enhancement"
☆14Mar 21, 2025Updated last year
ooshyun / Speech-Enhancement-Pytorch
View on GitHub
Pytorch Models for Speech Enhancement
☆23Mar 31, 2023Updated 3 years ago
cuhealthybrains / MT-LLM
View on GitHub
The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"
☆50Apr 7, 2025Updated last year
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
codebyzeb / g2p-plus
View on GitHub
Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories
☆19Apr 10, 2025Updated last year
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
kts707 / real-time-audio-denoiser
View on GitHub
A CNN-based audio denoiser
☆10May 2, 2021Updated 5 years ago
sa-if / Audio-Denoiser
View on GitHub
Python based audio denoiser 🔉
☆30Jun 4, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
manyeyes / K2TransducerAsr
View on GitHub
c# library for decoding K2 transducer Models，used in speech recognition (ASR)
☆13Aug 20, 2025Updated 11 months ago
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
nathalisr / GCC-PHAT
View on GitHub
A study about the Generalized Cross-Correlation with Phase Transform algorithm.
☆14Nov 23, 2021Updated 4 years ago
eagomez2 / upf-smc-speech-enhancement-thesis
View on GitHub
Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario
☆27Jan 25, 2024Updated 2 years ago
Twinkzzzzz / MeanSE
View on GitHub
Official implementation of 'MeanSE: Efficient Generative Speech Enhancement with Mean Flows'
☆19Oct 11, 2025Updated 9 months ago
manyeyes / AliCTTransformerPunc
View on GitHub
c# library for decoding CTTransformer punc models, which can add punctuation to Chinese and English texts
☆14Aug 18, 2025Updated 11 months ago
avryhof / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Mar 9, 2022Updated 4 years ago
jhauret / vibravox
View on GitHub
Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.
☆50Dec 1, 2025Updated 7 months ago
nobmaste / QH_Learning_Resources
View on GitHub
☆12Aug 15, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AaltoAcousticsLab / aalto-datasets
View on GitHub
A list of datasets made available by members of the Aalto Acoustics Lab
☆31Sep 6, 2024Updated last year
manyeyes / KaldiNativeFbankSharp
View on GitHub
c# wrapper for kaldi-native-fbank，used to extract audio features in speech recognition (ASR) task
☆10Jul 26, 2025Updated 11 months ago
apple / ml-acn-embed
View on GitHub
Acoustic Neighbor Embeddings
☆33Jul 13, 2025Updated last year
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
Vanka0051 / speech_enhancement
View on GitHub
speech enhancement using DNN: [1] Xu, Y., Du, J., Dai, L.R. and Lee, C.H., 2015. A regression approach to speech enhancement based on dee…
☆14Sep 17, 2019Updated 6 years ago
TEDddr / Adap-WTD
View on GitHub
自适应的小波阈值降噪
☆14Aug 11, 2023Updated 2 years ago
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 3 years ago