Official repository for FlowSE (Interspeech 2025)
☆89Jul 9, 2025Updated 7 months ago
Alternatives and similar repositories for FlowSE
Users that are interested in FlowSE are comparing it to the libraries listed below
Sorting:
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆91Jul 23, 2025Updated 7 months ago
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- ☆32Jan 9, 2024Updated 2 years ago
- ☆38Apr 3, 2025Updated 11 months ago
- An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.☆24Oct 29, 2025Updated 4 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated 2 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆26May 25, 2023Updated 2 years ago
- Brownian Bridge with Exponential Diffusion Coefficient☆44Nov 1, 2023Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Jul 31, 2024Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆89Feb 2, 2026Updated last month
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆75Jun 16, 2025Updated 8 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- A real-time voice conversion model based on VITS.☆14Aug 1, 2024Updated last year
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆87May 26, 2025Updated 9 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated 9 months ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆254Sep 13, 2024Updated last year
- Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"☆10May 8, 2023Updated 2 years ago
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 11 months ago
- ☆20Aug 25, 2025Updated 6 months ago
- ☆11Dec 17, 2025Updated 2 months ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- ☆23Aug 4, 2025Updated 6 months ago
- This is the official implementation of reverberant speech to room impulse response estimator☆41Aug 7, 2024Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆42Jul 4, 2025Updated 7 months ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆33Jan 28, 2026Updated last month
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆724Feb 1, 2026Updated last month
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- Official implementation for FlowSep☆70Jan 2, 2025Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 4 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆79May 21, 2025Updated 9 months ago
- A GPU accelerated and torch based audio DSP library☆123Feb 23, 2026Updated last week
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆26May 29, 2024Updated last year
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆17Aug 1, 2024Updated last year
- Code for reproducing the experiments and results of "Multi-Source Contrastive Learning from Musical Audio", accepted for publication in S…☆17Nov 13, 2023Updated 2 years ago