sizhelee/Diff-BGM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sizhelee/Diff-BGM)

sizhelee / Diff-BGM

official code for CVPR'24 paper Diff-BGM

☆71

Alternatives and similar repositories for Diff-BGM

Users that are interested in Diff-BGM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhuole1025 / SymMV
View on GitHub
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
☆78Mar 29, 2024Updated 2 years ago
ZeyueT / VidMuse
View on GitHub
[CVPR 2025] Repository of VidMuse
☆140Jun 7, 2025Updated last year
schowdhury671 / melfusion
View on GitHub
☆58Oct 10, 2024Updated last year
TiffanyBlews / MozartsTouch
View on GitHub
Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models
☆43Mar 17, 2026Updated 4 months ago
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chouliuzuo / GVMGen
View on GitHub
☆32Nov 10, 2025Updated 8 months ago
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
wbs2788 / MTM
View on GitHub
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…
☆28Jan 21, 2025Updated last year
Kikyo-16 / airgen
View on GitHub
Official source codes of airsep
☆39Mar 26, 2024Updated 2 years ago
sander-wood / deepchoir
View on GitHub
Chord-Conditioned Melody Harmonization with Controllable Harmonicity [ICASSP 2023]
☆49Jul 15, 2023Updated 3 years ago
aik2mlj / polyffusion
View on GitHub
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
☆89Jul 16, 2024Updated 2 years ago
justivanr / art2mus_
View on GitHub
Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…
☆20Oct 20, 2025Updated 9 months ago
lingyu123-su / Amadeus
View on GitHub
To make music production easier, we introduce Amadeus , a novel MIDI generation framework. While significantly improving generation quali…
☆16Aug 29, 2025Updated 10 months ago
luosiallen / Diff-Foley
View on GitHub
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
☆205May 29, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
wzk1015 / video-bgm-generation
View on GitHub
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
☆326Jun 8, 2025Updated last year
happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆194Mar 25, 2024Updated 2 years ago
YatingMusic / MusiConGen
View on GitHub
☆88Oct 20, 2024Updated last year
jryban / frechet-music-distance
View on GitHub
A library for computing Frechet Music Distance.
☆31Feb 4, 2025Updated last year
liuhuadai / AudioLCM
View on GitHub
PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.
☆13Jun 15, 2024Updated 2 years ago
Text-to-Audio / Make-An-Audio-3
View on GitHub
Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers
☆121May 19, 2025Updated last year
ylacombe / musicgen-dreamboothing
View on GitHub
Fine-tune your own MusicGen with LoRA
☆161Apr 26, 2024Updated 2 years ago
Kikyo-16 / coco-mulla-repo
View on GitHub
Official source codes of coco-mulla
☆36Mar 21, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mulab-mir / muchomusic
View on GitHub
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
☆46Dec 3, 2024Updated last year
RetroCirce / MusicLDM
View on GitHub
The latent diffusion model for text-to-music generation.
☆187Jan 26, 2024Updated 2 years ago
shansongliu / MuMu-LLaMA
View on GitHub
This is the official repository for M2UGen
☆513Jan 2, 2025Updated last year
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆50Jan 23, 2025Updated last year
EmilianPostolache / stable-audio-controlnet
View on GitHub
Fine-tune Stable Audio Open with DiT ControlNet.
☆256May 16, 2025Updated last year
PeiChunChang / MS-SincResNet
View on GitHub
This paper has been accepted in ACM ICMR 2021.
☆20Nov 17, 2025Updated 8 months ago
OpenGVLab / LORIS
View on GitHub
[ICML2023] Long-Term Rhythmic Video Soundtracker
☆63Jul 28, 2025Updated 11 months ago
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year
Kahsolt / TransTacoS-RetuneGAN
View on GitHub
A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.
☆15May 25, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
NEXTLab-ZJU / MelodyGLM
View on GitHub
☆13Sep 1, 2023Updated 2 years ago
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
slSeanWU / MusDr
View on GitHub
Evaluation metrics for machine-composed symbolic music. Paper: "The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-…
☆64Oct 29, 2020Updated 5 years ago
XZWY / MSLDM
View on GitHub
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆29Sep 12, 2024Updated last year
fundwotsai2001 / MuseControlLite
View on GitHub
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]
☆68Jan 6, 2026Updated 6 months ago
yukara-ikemiya / minimal-musicgen-for-developers
View on GitHub
[PyTorch] Minimal codebase for MusicGen models
☆63Jan 7, 2025Updated last year
Yuer867 / EMO_Harmonizer
View on GitHub
This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.
☆12Sep 25, 2024Updated last year