Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
☆63Feb 19, 2025Updated last year
Alternatives and similar repositories for control-transfer-diffusion
Users that are interested in control-transfer-diffusion are comparing it to the libraries listed below
Sorting:
- AFTER : Audio Features Transfer and Exploration in Real-time☆109Sep 8, 2025Updated 6 months ago
- The official implementation of TokenSynth (ICASSP 2025)☆81Oct 27, 2025Updated 4 months ago
- This is the official repository of PLaTune, our Pretrained Latents Tuner model that enables to add temporal musical controls on top of pr…☆17Jun 28, 2025Updated 8 months ago
- ☆55Nov 5, 2024Updated last year
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆40Jan 17, 2026Updated 2 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆249May 16, 2025Updated 10 months ago
- Encode and decode audio samples to/from compressed latent representations!☆250Sep 19, 2025Updated 6 months ago
- Front-end for symbolic music AI models☆17Nov 20, 2025Updated 4 months ago
- ☆117Feb 26, 2026Updated 3 weeks ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 5 months ago
- ☆18Nov 8, 2024Updated last year
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 10 months ago
- ☆20May 7, 2025Updated 10 months ago
- ☆47Nov 13, 2021Updated 4 years ago
- ☆87Jan 29, 2023Updated 3 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆49Jan 19, 2026Updated 2 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆35Sep 11, 2025Updated 6 months ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆99Feb 24, 2025Updated last year
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 9 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆58Nov 10, 2025Updated 4 months ago
- Official source codes of coco-mulla☆36Mar 21, 2024Updated 2 years ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆106Nov 25, 2025Updated 3 months ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Mar 10, 2025Updated last year
- ☆30Mar 19, 2025Updated last year
- ☆17Sep 2, 2025Updated 6 months ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆136Feb 3, 2025Updated last year
- ☆32Nov 25, 2023Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- Abstractions for Latent Jamming with nn~ compatible neural audio models written in Pure Data☆19Mar 1, 2026Updated 2 weeks ago
- Ravetable synthesis - Latent signal processing☆37Sep 25, 2025Updated 5 months ago
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆48Sep 11, 2024Updated last year
- PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing☆110Jun 1, 2025Updated 9 months ago
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated 11 months ago
- applying audio FX with text descriptors☆33Nov 12, 2025Updated 4 months ago
- Multitrack music mixing style transfer given a reference song using differentiable mixing console.☆58Jul 7, 2025Updated 8 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆229May 11, 2025Updated 10 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆78Jun 19, 2025Updated 9 months ago
- ☆14Sep 21, 2022Updated 3 years ago