Harmonai-org / oobleck
open soundstream-ish VAE codecs for downstream neural audio synthesis
☆109Updated last year
Related projects: ⓘ
- Code for Chamber Ensemble Generator pipeline and CocoChorales Dataset☆57Updated 6 months ago
- Generate new latent codes for RAVE with Denoising Diffusion models.☆158Updated 9 months ago
- ☆80Updated last year
- Trainer for audio-diffusion-pytorch☆127Updated last year
- ☆154Updated 10 months ago
- (ML) audio engineering i/o utils☆52Updated 8 months ago
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆74Updated 5 months ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆119Updated 2 years ago
- A collection of useful audio datasets and transforms for PyTorch.☆130Updated last year
- Self-supervised learning for fast pitch estimation☆171Updated last month
- Encode and decode audio samples to/from compressed latent representations!☆119Updated last month
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆71Updated 2 months ago
- Headless multitrack mixing console in Python☆114Updated last year
- A collection of pre-trained audio models, in PyTorch.☆109Updated last year
- Anticipatory Autoregressive Models☆143Updated last week
- Models and datasets for training deep learning automatic mixing models☆92Updated 3 weeks ago
- ☆71Updated last year
- General Purpose Audio Effect Removal☆92Updated last year
- ☆44Updated 3 years ago
- Flexible LoRA Implementation to use with stable-audio-tools☆37Updated last week
- This is a cog implementation of the fine-tuner for Meta's MusicGen☆46Updated 5 months ago
- alchemy with embeddings☆33Updated last year
- Song Describer is a data collection platform for annotating music with textual descriptions.☆57Updated 3 months ago
- A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB …☆149Updated 3 months ago
- ☆145Updated last year
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated last year
- Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)☆133Updated 6 months ago
- PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model☆69Updated 9 months ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆111Updated last year
- ☆46Updated last year