iamycy / diffwave-srLinks
☆83Updated 2 years ago
Alternatives and similar repositories for diffwave-sr
Users that are interested in diffwave-sr are comparing it to the libraries listed below
Sorting:
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆62Updated 2 years ago
- ☆67Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆37Updated 3 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆118Updated 2 years ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆55Updated 3 years ago
- ☆44Updated last year
- ☆63Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆32Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆42Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆43Updated 3 months ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆25Updated 3 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆43Updated 9 months ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆89Updated last month
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- Project for MIDI to Audio Synthesis☆25Updated 2 years ago
- ☆106Updated 2 weeks ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆52Updated 3 months ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆44Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- Chorale Music Separation Dataset and Model Framework☆38Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated last year
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆55Updated 9 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 5 months ago