DiffPhase: Generative Diffusion-based STFT Phase Retrieval
☆16Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for diffphase
Users that are interested in diffphase are comparing it to the libraries listed below
Sorting:
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35May 25, 2023Updated 2 years ago
- ☆16Sep 19, 2023Updated 2 years ago
- ☆87May 21, 2023Updated 2 years ago
- ☆54Mar 2, 2023Updated 2 years ago
- ☆29Mar 28, 2024Updated last year
- ☆67Aug 16, 2023Updated 2 years ago
- phase reconstruction from magnitude terms of an STFT☆13May 18, 2025Updated 9 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆17Sep 13, 2024Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆111Aug 29, 2024Updated last year
- ☆38Feb 1, 2024Updated 2 years ago
- ☆18Mar 10, 2023Updated 2 years ago
- A list of publications that have accompanying open-source code☆22Oct 30, 2023Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Jul 31, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- DeepLearningで音楽をアップサンプリングします☆19Mar 24, 2018Updated 7 years ago
- ☆25Oct 4, 2022Updated 3 years ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆108Jan 17, 2025Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆79Jun 8, 2025Updated 8 months ago
- ☆32Apr 22, 2024Updated last year
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆29Apr 12, 2024Updated last year
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆237May 1, 2025Updated 9 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- ☆30Jul 18, 2024Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆69Dec 9, 2022Updated 3 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- ☆70Jan 25, 2025Updated last year
- Fast Independent Vector Extraction: Code and data to reproduce the results from the paper.☆24May 7, 2020Updated 5 years ago
- ☆59Jun 14, 2024Updated last year
- Performance-oriented implementation of independent vector analysis for blind source separation.☆26Mar 26, 2020Updated 5 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated 11 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆30Jun 17, 2025Updated 8 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated 10 months ago
- ☆38Jun 5, 2023Updated 2 years ago