DiffPhase: Generative Diffusion-based STFT Phase Retrieval
☆16Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for diffphase
Users that are interested in diffphase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆36May 25, 2023Updated 3 years ago
- ☆87May 21, 2023Updated 3 years ago
- ☆17Sep 19, 2023Updated 2 years ago
- DeepLearningで音楽をアップサンプリングします☆20Mar 24, 2018Updated 8 years ago
- ☆55Mar 2, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- phase reconstruction from magnitude terms of an STFT☆13May 18, 2025Updated last year
- ☆68Aug 16, 2023Updated 2 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 3 years ago
- The official PyTorch implementation of VM-ASR, a model designed for high-fidelity audio super-resolution.☆23Sep 8, 2025Updated 9 months ago
- ☆19Mar 10, 2023Updated 3 years ago
- ☆28Mar 28, 2024Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆114Aug 29, 2024Updated last year
- This is official repository of new SOTA diffusion models based method for speech enhancement☆42Jul 31, 2024Updated last year
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆243May 1, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆72Dec 9, 2022Updated 3 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆113Jan 17, 2025Updated last year
- Working repository for the MUSCIMA++ dataset☆13May 16, 2021Updated 5 years ago
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆30Apr 12, 2024Updated 2 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆44Jun 13, 2024Updated last year
- ☆41Feb 1, 2024Updated 2 years ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated last year
- ☆59Jun 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation of the paper A Repetition-based Triplet Mining Approach for Music Segmentation presented at ISMIR 2023.☆13Nov 9, 2023Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- ☆38Jun 5, 2023Updated 3 years ago
- ☆81Jan 22, 2025Updated last year
- ☆15Nov 26, 2024Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆90Jun 8, 2025Updated last year
- A list of publications that have accompanying open-source code☆22Oct 30, 2023Updated 2 years ago
- Source code of APNet2, a vocoder☆59Nov 23, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆25Oct 4, 2022Updated 3 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 7 months ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆23Oct 10, 2025Updated 7 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆30Jun 17, 2025Updated 11 months ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆29Nov 18, 2025Updated 6 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 7 months ago