sp-uhh / diffphase
DiffPhase: Generative Diffusion-based STFT Phase Retrieval
☆14Updated last year
Alternatives and similar repositories for diffphase:
Users that are interested in diffphase are comparing it to the libraries listed below
- ☆61Updated last year
- A neural speech codec based on discrete WavLM representations☆23Updated 8 months ago
- ☆26Updated last year
- ☆25Updated last year
- ☆12Updated last year
- Implementation of SpatialCodec.☆56Updated last year
- ☆13Updated last year
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆54Updated 3 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- ☆49Updated 2 years ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 3 months ago
- ☆17Updated 9 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 3 weeks ago
- SRTNet☆24Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆25Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 9 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 5 months ago
- ☆24Updated last year
- ☆48Updated 3 weeks ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆46Updated 6 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆31Updated 4 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- offical code for Dense-TSNet☆12Updated 7 months ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆31Updated 7 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆73Updated 3 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆61Updated 3 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- TODO☆38Updated last year