Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
☆66Feb 19, 2025Updated last year
Alternatives and similar repositories for control-transfer-diffusion
Users that are interested in control-transfer-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AFTER : Audio Features Transfer and Exploration in Real-time☆126May 16, 2026Updated 3 weeks ago
- The official implementation of TokenSynth (ICASSP 2025)☆90Oct 27, 2025Updated 7 months ago
- This is the official repository of PLaTune, our Pretrained Latents Tuner model that enables to add temporal musical controls on top of pr…☆18Jun 28, 2025Updated 11 months ago
- ☆56Nov 5, 2024Updated last year
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆43Jan 17, 2026Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Fine-tune Stable Audio Open with DiT ControlNet.☆252May 16, 2025Updated last year
- Encode and decode audio samples to/from compressed latent representations!☆258Sep 19, 2025Updated 8 months ago
- ☆121Jun 2, 2026Updated last week
- Front-end for symbolic music AI models☆17Nov 20, 2025Updated 6 months ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆114Nov 25, 2025Updated 6 months ago
- ☆18Nov 8, 2024Updated last year
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated last year
- ☆88Jan 29, 2023Updated 3 years ago
- ☆48Nov 13, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20May 7, 2025Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆49Jan 19, 2026Updated 4 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆37Sep 11, 2025Updated 8 months ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆99Feb 24, 2025Updated last year
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆57Nov 10, 2025Updated 7 months ago
- Official source codes of coco-mulla☆36Mar 21, 2024Updated 2 years ago
- Official implementation of WildFX Dataset Generating pipeline.☆18Oct 21, 2025Updated 7 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆51May 24, 2025Updated last year
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Mar 10, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆31May 22, 2026Updated 2 weeks ago
- ☆17Sep 2, 2025Updated 9 months ago
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆137Feb 3, 2025Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆41Jan 6, 2024Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- Abstractions for Latent Jamming with nn~ compatible neural audio models written in Pure Data☆19May 6, 2026Updated last month
- Ravetable synthesis - Latent signal processing☆38Sep 25, 2025Updated 8 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆49Sep 11, 2024Updated last year
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch implementation of SoundCTM☆101Mar 31, 2025Updated last year
- PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing☆128Jun 1, 2025Updated last year
- ☆14Sep 21, 2022Updated 3 years ago
- XMIDI Dataset: A large-scale symbolic music dataset with emotion and genre labels.☆38Jan 16, 2025Updated last year
- ☆401Jan 16, 2025Updated last year
- Multitrack music mixing style transfer given a reference song using differentiable mixing console.☆64Jul 7, 2025Updated 11 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆244May 11, 2025Updated last year