NVIDIA / diffusion-audio-restorationLinks
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
☆89Updated 3 weeks ago
Alternatives and similar repositories for diffusion-audio-restoration
Users that are interested in diffusion-audio-restoration are comparing it to the libraries listed below
Sorting:
- ☆55Updated 7 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆43Updated 3 months ago
- ☆44Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆37Updated 3 months ago
- Fast and accurate fundamental frequency (F0) detector using convolutional neural networks☆66Updated last week
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆69Updated last month
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆52Updated 3 months ago
- The official implementation of TokenSynth (ICASSP 2025)☆74Updated 3 months ago
- Prosody and Pronunciation Modification Network☆56Updated 4 months ago
- Landing Page for All Things Source Separation☆33Updated last month
- ☆28Updated last year
- SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering☆86Updated 2 weeks ago
- ☆106Updated 2 weeks ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆42Updated 2 years ago
- Zero-Shot Blind Audio Bandwidth Extension☆25Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆75Updated 7 months ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆55Updated 2 months ago
- ☆67Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆62Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆82Updated 3 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆41Updated 4 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆41Updated last month
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆32Updated 6 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 5 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- Separate Anything in Audio with Zero Training☆40Updated 3 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆42Updated 3 months ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆32Updated 2 years ago