sp-uhh / diffphaseLinks
DiffPhase: Generative Diffusion-based STFT Phase Retrieval
☆16Updated 2 years ago
Alternatives and similar repositories for diffphase
Users that are interested in diffphase are comparing it to the libraries listed below
Sorting:
- ☆65Updated 2 years ago
- ☆34Updated last year
- ☆54Updated 2 years ago
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆68Updated 2 months ago
- ☆28Updated last year
- ☆16Updated 2 years ago
- ☆13Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆44Updated 11 months ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated this week
- ☆29Updated last year
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆92Updated 2 months ago
- ☆35Updated 2 years ago
- ☆25Updated 3 years ago
- ☆84Updated 2 years ago
- ☆49Updated 6 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆31Updated last year
- Official implementation of Self-Remixing☆16Updated last year
- Implementation of SpatialCodec.☆62Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- Source code of APNet2, a vocoder☆55Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆38Updated 3 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Updated last week
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Updated 2 years ago
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆27Updated last week
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆62Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆76Updated 4 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆38Updated 5 months ago
- ☆27Updated last year
- ☆30Updated last year