Harmonai-org / oobleck
open soundstream-ish VAE codecs for downstream neural audio synthesis
☆113Updated last year
Related projects ⓘ
Alternatives and complementary repositories for oobleck
- Generate new latent codes for RAVE with Denoising Diffusion models.☆163Updated 11 months ago
- ☆81Updated last year
- Trainer for audio-diffusion-pytorch☆127Updated last year
- Encode and decode audio samples to/from compressed latent representations!☆150Updated 3 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆134Updated last year
- ☆74Updated last year
- Models and datasets for training deep learning automatic mixing models☆95Updated 2 months ago
- Self-supervised learning for fast pitch estimation☆191Updated last month
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆75Updated 7 months ago
- (ML) audio engineering i/o utils☆54Updated 10 months ago
- Code for Investigating Personalization Methods in Text to Music Generation☆35Updated 7 months ago
- ☆156Updated last year
- Code for Chamber Ensemble Generator pipeline and CocoChorales Dataset☆58Updated 8 months ago
- Anticipatory Autoregressive Models☆151Updated last month
- A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming☆55Updated 8 months ago
- ☆46Updated 3 years ago
- Audio generation using diffusion models, in PyTorch.☆46Updated last year
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆74Updated 4 months ago
- ☆11Updated 2 years ago
- Unofficial download repository for MusicCaps☆44Updated last year
- Results and Models for Learning Audio Representations of Music Content☆93Updated 5 months ago
- Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis☆131Updated 2 years ago
- Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)☆138Updated 8 months ago
- alchemy with embeddings☆34Updated last year
- Headless multitrack mixing console in Python☆116Updated last year
- A simple library for Fréchet Audio Distance (FAD) calculation☆147Updated this week
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆37Updated 4 months ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆93Updated last month
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆112Updated last year
- General Purpose Audio Effect Removal☆95Updated last year