Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis
☆145Dec 12, 2021Updated 4 years ago
Alternatives and similar repositories for MelSpecVAE
Users that are interested in MelSpecVAE are comparing it to the libraries listed below
Sorting:
- ☆14Sep 21, 2022Updated 3 years ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆119Dec 13, 2021Updated 4 years ago
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆21Dec 3, 2021Updated 4 years ago
- Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder☆1,690Jun 23, 2025Updated 8 months ago
- VST/AU Plugin for Auditioning RAVE Models in Real-time☆86Mar 25, 2022Updated 3 years ago
- "Neural Loop Combiner: Neural Network Models For Assessing The Compatibility of Loops", ISMIR 2020☆33Nov 8, 2020Updated 5 years ago
- Generate new latent codes for RAVE with Denoising Diffusion models.☆182Dec 2, 2024Updated last year
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆11Nov 25, 2021Updated 4 years ago
- Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch☆506Oct 28, 2023Updated 2 years ago
- efficient neural audio synthesis in the waveform domain☆191Apr 14, 2025Updated 10 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆366Feb 16, 2026Updated 2 weeks ago
- MAX/MSP objects for audio and rhythmic synthesis using networks of coupled oscillators☆13May 5, 2023Updated 2 years ago
- ☆193Jun 21, 2023Updated 2 years ago
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- fork of the RAVE model: a Realtime Audio Variational autoEncoder☆47Nov 22, 2025Updated 3 months ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- ☆15May 9, 2024Updated last year
- Neural network bending framework for creativity in Pytorch☆36Dec 5, 2024Updated last year
- Based on Neural Amp Modeler 0.7.1 with some enhanced features☆12Apr 18, 2023Updated 2 years ago
- Collection of audio-focused loss functions in PyTorch☆854Jul 30, 2024Updated last year
- A PyTorch implementation of the musicnn model for music audio tagging☆38Jul 25, 2024Updated last year
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- Simple Python CLI script for downloading N-hours of audio from Youtube, based on a list of music genres.☆33Dec 13, 2023Updated 2 years ago
- Synthesis of Drum Sounds With Perceptual Timbral Conditioning Using Generative Adversarial Networks☆125Mar 9, 2023Updated 2 years ago
- ☆10Oct 9, 2025Updated 4 months ago
- A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API☆30Dec 30, 2021Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- ☆402Jul 8, 2025Updated 7 months ago
- 4 Hour cuSignal Tutorial - ICASSP 2021 Notebooks☆49Jun 7, 2021Updated 4 years ago
- Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)☆333Nov 30, 2022Updated 3 years ago
- Official repo of ISMIR-21 publication, “A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset”.☆83Nov 17, 2021Updated 4 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Differentiable FM Synthesis of Musical Instrument Sounds☆144Aug 15, 2022Updated 3 years ago
- Max for Live(M4L) Rhythm generator using Variational Autoencoder(VAE)☆243Dec 16, 2025Updated 2 months ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- ☆87May 21, 2023Updated 2 years ago