NilsDem/control-transfer-diffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NilsDem/control-transfer-diffusion)

NilsDem / control-transfer-diffusion

Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024

☆67

Alternatives and similar repositories for control-transfer-diffusion

Users that are interested in control-transfer-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

acids-ircam / AFTER
View on GitHub
AFTER : Audio Features Transfer and Exploration in Real-time
☆134May 16, 2026Updated 2 months ago
acids-ircam / platune
View on GitHub
This is the official repository of PLaTune, our Pretrained Latents Tuner model that enables to add temporal musical controls on top of pr…
☆18Jun 28, 2025Updated last year
KyungsuKim42 / tokensynth
View on GitHub
The official implementation of TokenSynth (ICASSP 2025)
☆91Jun 24, 2026Updated 3 weeks ago
SonyCSLParis / codicodec
View on GitHub
Encode and decode audio samples to/from continuous and discrete compressed representations!
☆121Nov 25, 2025Updated 7 months ago
sony / diffusion-timbre-transfer
View on GitHub
☆56Nov 5, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fcaspe / BRAVE
View on GitHub
Low-latency timbre transfer models for instrumental interaction.
☆106Oct 10, 2025Updated 9 months ago
SonyCSLParis / music2latent
View on GitHub
Encode and decode audio samples to/from compressed latent representations!
☆267Sep 19, 2025Updated 10 months ago
devstermarts / PD-Latent-Jamming
View on GitHub
Abstractions for Latent Jamming with nn~ compatible neural audio models written in Pure Data
☆21Jul 9, 2026Updated last week
EmilianPostolache / stable-audio-controlnet
View on GitHub
Fine-tune Stable Audio Open with DiT ControlNet.
☆256May 16, 2025Updated last year
acids-ircam / ravetable
View on GitHub
Ravetable synthesis - Latent signal processing
☆38Sep 25, 2025Updated 9 months ago
HSUNEH / DOSE
View on GitHub
☆19Sep 22, 2025Updated 9 months ago
koichi-saito-sony / ismir2024_tutorial_demo
View on GitHub
☆18Nov 8, 2024Updated last year
csteinmetz1 / st-ito
View on GitHub
Audio production style transfer with inference-time optimization
☆57Updated this week
fundwotsai2001 / AP-adapter
View on GitHub
Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]
☆57Nov 10, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
spluta / RTNeural_Plugin
View on GitHub
☆17Sep 2, 2025Updated 10 months ago
geoffroypeeters / ssmnet_ISMIR2023
View on GitHub
☆20Oct 20, 2023Updated 2 years ago
zhaojw1998 / Structured-Arrangement-Code
View on GitHub
Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.
☆43Jan 17, 2026Updated 6 months ago
gzhu06 / Cacophony
View on GitHub
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
☆49Jan 19, 2026Updated 6 months ago
qiuqiangkong / audioflow
View on GitHub
☆128Updated this week
yoyolicoris / music-spectrogram-diffusion-pytorch
View on GitHub
☆88Jan 29, 2023Updated 3 years ago
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year
yongyizang / SynthTab
View on GitHub
Official Repository for ICASSP 2024 Paper "SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription"
☆33Dec 6, 2024Updated last year
yukara-ikemiya / friendly-stable-audio-tools
View on GitHub
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…
☆220Jul 25, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NVIDIA / diffusion-audio-restoration
View on GitHub
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
☆145Aug 13, 2025Updated 11 months ago
TeeJayBaker / PolyDDSP
View on GitHub
Polyphonic generalisation of DDSP
☆22Apr 30, 2024Updated 2 years ago
hyakuchiki / diffsynth
View on GitHub
☆48Nov 13, 2021Updated 4 years ago
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
csteinmetz1 / dasp-pytorch
View on GitHub
Differentiable audio signal processors in PyTorch
☆297Dec 4, 2023Updated 2 years ago
sh-lee97 / grafx
View on GitHub
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
☆139Jun 29, 2026Updated 3 weeks ago
SonyCSLParis / pesto
View on GitHub
Self-supervised learning for real-time pitch estimation
☆297Oct 15, 2025Updated 9 months ago
MIDIAI / MuseCraft
View on GitHub
Front-end for symbolic music AI models
☆17Nov 20, 2025Updated 8 months ago
bernardo-torres / linear-autoencoders
View on GitHub
Official code and pretrained models for Linear Consistency Autoencoders (Lin-CAE), a method to induce linearity in audio autoencoders via…
☆17Feb 12, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jasper-zheng / music2latent-scripted
View on GitHub
Scripting Music2Latent to TorchScript for streamable continuous inference in MaxMSP/PureData
☆22Feb 5, 2026Updated 5 months ago
SonyResearch / Fx-Encoder_PlusPlus
View on GitHub
"Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"
☆52Aug 23, 2025Updated 10 months ago
tamlablinz / RAVE_PCA
View on GitHub
Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration
☆21Jul 15, 2025Updated last year
sony / sampleid
View on GitHub
Code for the paper “Automatic Music Sample Identification with Multi-Track Contrastive Learning”.
☆25May 22, 2026Updated last month
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
weAreMusicAI / dmx-diffusion
View on GitHub
☆15Oct 13, 2025Updated 9 months ago