iamycy / diffwave-sr
☆79Updated last year
Related projects ⓘ
Alternatives and complementary repositories for diffwave-sr
- ☆61Updated 7 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆30Updated 10 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated 3 weeks ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆54Updated last year
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆52Updated 2 years ago
- Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation☆23Updated 8 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- ☆59Updated last year
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆50Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- ☆47Updated 4 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆37Updated last year
- Unofficial implementation of NANSY++ in Pytorch Lightning☆48Updated 8 months ago
- ☆34Updated 4 months ago
- ☆40Updated 5 months ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆78Updated 7 months ago
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆20Updated 7 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆34Updated 3 weeks ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Audio production style transfer with inference-time optimization☆16Updated 4 months ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- ☆14Updated 2 years ago
- ☆49Updated last year
- ☆87Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆25Updated 5 months ago