ZeyueT/VidMuse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZeyueT/VidMuse)

ZeyueT / VidMuse

[CVPR 2025] Repository of VidMuse

☆140

Alternatives and similar repositories for VidMuse

Users that are interested in VidMuse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sizhelee / Diff-BGM
View on GitHub
official code for CVPR'24 paper Diff-BGM
☆71Oct 12, 2024Updated last year
zhuole1025 / SymMV
View on GitHub
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
☆78Mar 29, 2024Updated 2 years ago
chouliuzuo / GVMGen
View on GitHub
☆32Nov 10, 2025Updated 8 months ago
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
wzk1015 / Awesome-Vision-to-Music-Generation
View on GitHub
[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.
☆126Aug 9, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Apple-jun / FilmComposer
View on GitHub
Music production for silent film clips.
☆34Apr 30, 2025Updated last year
yongyizang / AreYouReallyListening
View on GitHub
Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"
☆20Aug 18, 2025Updated 11 months ago
happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆194Mar 25, 2024Updated 2 years ago
wzk1015 / video-bgm-generation
View on GitHub
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
☆327Jun 8, 2025Updated last year
ilpoviertola / V-AURA
View on GitHub
The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)
☆35Feb 11, 2026Updated 5 months ago
v-iashin / Synchformer
View on GitHub
Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)
☆130Sep 15, 2025Updated 10 months ago
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
Xiaohao-Liu / Awesome-Vison2Audio
View on GitHub
A curated list of Vision (video/image) to Audio Generation
☆107Feb 10, 2026Updated 5 months ago
shansongliu / MuMu-LLaMA
View on GitHub
This is the official repository for M2UGen
☆513Jan 2, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yuhui1038 / Muse
View on GitHub
ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
☆119Apr 11, 2026Updated 3 months ago
fundwotsai2001 / Text-to-Music_control_family
View on GitHub
Containing SOTA methods that follows time-varying conditions for Text-to-Music
☆24Jan 1, 2026Updated 6 months ago
fundwotsai2001 / MuseControlLite
View on GitHub
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]
☆68Jan 6, 2026Updated 6 months ago
suncerock / EAsT-music-classification
View on GitHub
Audio Embeddings as Teachers for Music Classification
☆13Sep 7, 2023Updated 2 years ago
jnwnlee / video-foley
View on GitHub
Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…
☆19Feb 27, 2026Updated 5 months ago
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆315Nov 5, 2025Updated 8 months ago
wbs2788 / MTM
View on GitHub
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…
☆28Jan 21, 2025Updated last year
ariesssxu / vta-ldm
View on GitHub
☆61Jun 15, 2025Updated last year
PolyPerceiver-Lab / STAV2A
View on GitHub
☆20Aug 11, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
haoheliu / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆304Dec 13, 2024Updated last year
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆360Aug 4, 2025Updated 11 months ago
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year
open-mmlab / FoleyCrafter
View on GitHub
[IJCV 2026] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝
☆658Jun 15, 2026Updated last month
QwenAudio / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,371May 20, 2025Updated last year
ZeyueT / AudioX
View on GitHub
[ICLR 2026] Repository of AudioX
☆1,544Mar 10, 2026Updated 4 months ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
yzxing87 / Seeing-and-Hearing
View on GitHub
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
☆155Jul 6, 2024Updated 2 years ago
kaist-ami / SoundBrush
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OpenGVLab / LORIS
View on GitHub
[ICML2023] Long-Term Rhythmic Video Soundtracker
☆63Jul 28, 2025Updated last year
hkchengrex / av-benchmark
View on GitHub
Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…
☆80Feb 14, 2026Updated 5 months ago
juhayna-zh / AudioControlNet
View on GitHub
Official repository for the paper "Audio ControlNet for Fine-Grained Audio Generation and Editing".
☆77Feb 7, 2026Updated 5 months ago
NKU-HLT / AudioEditor
View on GitHub
☆47Apr 2, 2025Updated last year
chenjianyi / fastsag
View on GitHub
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
☆29Dec 19, 2024Updated last year
dennisvdang / chorus-detection
View on GitHub
A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input…
☆48Jul 15, 2026Updated last week
juhayna-zh / Awesome-Music-Generation-Papers
View on GitHub
Curated list of groundbreaking music generation research.
☆21Apr 24, 2026Updated 3 months ago