Official repository for FlowSE (Interspeech 2025)
☆100Jul 9, 2025Updated 10 months ago
Alternatives and similar repositories for FlowSE
Users that are interested in FlowSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆97Jul 23, 2025Updated 10 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆42Jul 31, 2024Updated last year
- An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.☆25Oct 29, 2025Updated 6 months ago
- ☆32Jan 9, 2024Updated 2 years ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆92Feb 2, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official code of SenSE.☆84Oct 30, 2025Updated 6 months ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆255Sep 13, 2024Updated last year
- A real-time voice conversion model based on VITS.☆17Aug 1, 2024Updated last year
- Brownian Bridge with Exponential Diffusion Coefficient☆44Nov 1, 2023Updated 2 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆79Jun 16, 2025Updated 11 months ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated last year
- Official implementation for FlowSep☆75Jan 2, 2025Updated last year
- Official implemtation of UniverSR (ICASSP 2026)☆48Apr 9, 2026Updated last month
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆750May 12, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆93May 26, 2025Updated 11 months ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- ☆39Apr 3, 2025Updated last year
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆56Jul 6, 2023Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆482May 19, 2025Updated last year
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆28May 29, 2024Updated last year
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated last year
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆348Jan 1, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of the paper PitchFlower: A flow-based neural audio codec with pitch controllability☆35Nov 3, 2025Updated 6 months ago
- ☆23Aug 4, 2025Updated 9 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated last year
- This is the official implementation of reverberant speech to room impulse response estimator☆42Aug 7, 2024Updated last year
- A GPU accelerated and torch based audio DSP library☆131May 5, 2026Updated 2 weeks ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- This is the audio sample repository for speech separation model "MossFormer2".☆183Nov 28, 2024Updated last year
- A lightweight audio codec based on a single quantizer☆34Sep 4, 2025Updated 8 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Generation scripts for EARS-WHAM and EARS-Reverb☆44Jul 4, 2025Updated 10 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆47Mar 10, 2025Updated last year
- ☆12Oct 13, 2022Updated 3 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆79Feb 9, 2026Updated 3 months ago
- ☆22Aug 25, 2025Updated 8 months ago
- ☆65Jun 28, 2023Updated 2 years ago