EmilianPostolache/stable-audio-controlnet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EmilianPostolache/stable-audio-controlnet)

EmilianPostolache / stable-audio-controlnet

Fine-tune Stable Audio Open with DiT ControlNet.

☆256

Alternatives and similar repositories for stable-audio-controlnet

Users that are interested in stable-audio-controlnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyCSLParis / music2latent
View on GitHub
Encode and decode audio samples to/from compressed latent representations!
☆267Sep 19, 2025Updated 10 months ago
yukara-ikemiya / friendly-stable-audio-tools
View on GitHub
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…
☆218Jul 25, 2024Updated last year
NeuralNotW0rk / LoRAW
View on GitHub
Flexible LoRA Implementation to use with stable-audio-tools
☆84Sep 9, 2024Updated last year
NilsDem / control-transfer-diffusion
View on GitHub
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
☆67Feb 19, 2025Updated last year
sh-lee97 / grafx
View on GitHub
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
☆139Jun 29, 2026Updated 3 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
haidog-yaqub / EzAudio
View on GitHub
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
☆333Dec 17, 2025Updated 7 months ago
minzwon / musicfm
View on GitHub
☆268Feb 14, 2024Updated 2 years ago
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,826Updated this week
roserbatlleroca / mira
View on GitHub
MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…
☆35Nov 14, 2025Updated 8 months ago
yukara-ikemiya / minimal-musicgen-for-developers
View on GitHub
[PyTorch] Minimal codebase for MusicGen models
☆63Jan 7, 2025Updated last year
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
Stability-AI / stable-audio-metrics
View on GitHub
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
☆300Updated this week
gudgud96 / frechet-audio-distance
View on GitHub
A lightweight library for Frechet Audio Distance calculation.
☆317Feb 11, 2026Updated 5 months ago
mulab-mir / song-describer-dataset
View on GitHub
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
☆175Dec 22, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
mcomunita / syncfusion
View on GitHub
SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis
☆19Jul 22, 2024Updated 2 years ago
ismir-24-sub / unsupervised_compositional_representations
View on GitHub
ISMIR 24 Supplementary Material
☆14Oct 28, 2024Updated last year
SonyCSLParis / pesto
View on GitHub
Self-supervised learning for real-time pitch estimation
☆297Oct 15, 2025Updated 9 months ago
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year
ta603 / RefinPaint
View on GitHub
☆12Jul 5, 2024Updated 2 years ago
Pliploop / SLAP
View on GitHub
Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding
☆63Sep 25, 2025Updated 10 months ago
jryban / frechet-music-distance
View on GitHub
A library for computing Frechet Music Distance.
☆31Feb 4, 2025Updated last year
fundwotsai2001 / AP-adapter
View on GitHub
Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]
☆57Nov 10, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
gzhu06 / Cacophony
View on GitHub
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
☆49Jan 19, 2026Updated 6 months ago
ylacombe / musicgen-dreamboothing
View on GitHub
Fine-tune your own MusicGen with LoRA
☆161Apr 26, 2024Updated 2 years ago
SonyCSLParis / Stem-JEPA
View on GitHub
Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation
☆55Aug 6, 2024Updated last year
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆50Jan 23, 2025Updated last year
happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆194Mar 25, 2024Updated 2 years ago
ldzhangyx / instruct-MusicGen
View on GitHub
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…
☆109Jan 14, 2026Updated 6 months ago
YatingMusic / MusiConGen
View on GitHub
☆88Oct 20, 2024Updated last year
gladia-research-group / multi-source-diffusion-models
View on GitHub
☆171Aug 14, 2023Updated 2 years ago
fundwotsai2001 / MuseControlLite
View on GitHub
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]
☆68Jan 6, 2026Updated 6 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
audio-captioning / caption-evaluation-tools
View on GitHub
Tools for the evaluation of audio captioning.
☆19May 23, 2020Updated 6 years ago
MTG / SingWithExpressions
View on GitHub
This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics
☆16Oct 28, 2024Updated last year
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆314Nov 5, 2025Updated 8 months ago
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆121Jan 28, 2026Updated 5 months ago
QwenAudio / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,371May 20, 2025Updated last year
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year