sp-uhh / diffphaseLinks
DiffPhase: Generative Diffusion-based STFT Phase Retrieval
☆16Updated last year
Alternatives and similar repositories for diffphase
Users that are interested in diffphase are comparing it to the libraries listed below
Sorting:
- ☆31Updated last year
- ☆54Updated 2 years ago
- ☆63Updated last year
- ☆28Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated last month
- ☆13Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆27Updated last year
- ☆25Updated 2 years ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated last month
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆79Updated last week
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆36Updated 2 months ago
- ☆35Updated 2 years ago
- ☆26Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆41Updated 8 months ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35Updated 2 years ago
- Official implementation of Self-Remixing☆15Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆45Updated 2 months ago
- ☆48Updated 4 months ago
- Implementation of SpatialCodec.☆59Updated last year
- Source code of APNet2, a vocoder☆55Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆36Updated last month
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆61Updated 2 years ago
- A lightweight audio codec based on a single quantizer☆66Updated 4 months ago
- ☆36Updated last month
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Updated last year
- A neural speech codec based on discrete WavLM representations☆24Updated 11 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆12Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆30Updated last year
- Spherical residual vector quantization (SRVQ)☆30Updated 11 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆31Updated 2 years ago