AMAAI-Lab/Video2Music

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AMAAI-Lab/Video2Music)

AMAAI-Lab / Video2Music

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

☆196

Alternatives and similar repositories for Video2Music

Users that are interested in Video2Music are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhuole1025 / SymMV
View on GitHub
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
☆78Mar 29, 2024Updated 2 years ago
ivyha010 / EmoMV
View on GitHub
Datasets for affective music‑video retrieval
☆13Aug 21, 2022Updated 3 years ago
chouliuzuo / GVMGen
View on GitHub
☆32Nov 10, 2025Updated 8 months ago
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
AMAAI-Lab / MuVi
View on GitHub
Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses
☆22Oct 3, 2023Updated 2 years ago
AMAAI-Lab / t2m-inferalign
View on GitHub
Improving Symbolic Music Generation with Inference-Time Alignment
☆22Aug 2, 2025Updated 11 months ago
seungheondoh / speech-to-music
View on GitHub
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Aug 16, 2023Updated 2 years ago
wzk1015 / video-bgm-generation
View on GitHub
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
☆327Jun 8, 2025Updated last year
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
andrebola / contrastive-mir-learning
View on GitHub
This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"
☆15Jun 22, 2023Updated 3 years ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
Tayjsl97 / MusER
View on GitHub
This is the official implementation of MusER (AAAI'24).
☆31Jun 4, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 3 months ago
mulab-mir / song-describer-dataset
View on GitHub
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
☆175Dec 22, 2023Updated 2 years ago
guozixunnicolas / FundamentalMusicEmbedding
View on GitHub
☆32Nov 25, 2023Updated 2 years ago
yizhilll / MERT
View on GitHub
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆482May 25, 2025Updated last year
AMAAI-Lab / PreBit
View on GitHub
This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…
☆12Jul 29, 2025Updated last year
AMAAI-Lab / MidiCaps
View on GitHub
A large-scale dataset of caption-annotated MIDI files.
☆86Jul 23, 2024Updated 2 years ago
wazenmai / MIDI-BERT
View on GitHub
[JCMS 2024] This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.
☆206Apr 10, 2024Updated 2 years ago
sizhelee / Diff-BGM
View on GitHub
official code for CVPR'24 paper Diff-BGM
☆71Oct 12, 2024Updated last year
RetroCirce / Choral_Music_Separation
View on GitHub
Chorale Music Separation Dataset and Model Framework
☆41Dec 5, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
glory20h / VoiceLDM
View on GitHub
VoiceLDM: Text-to-Speech with Environmental Context
☆194Aug 9, 2024Updated last year
salu133445 / mmt
View on GitHub
Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)
☆155Mar 14, 2024Updated 2 years ago
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
ETH-DISCO / blap
View on GitHub
Official repo for BLAP: Bootstrapping Language-Audio Pre-training for Music Captioning presented at ICASSP 2025
☆16Nov 18, 2024Updated last year
shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
legoodmanner / jukedrummer
View on GitHub
☆39Mar 10, 2023Updated 3 years ago
joeljang / music2video
View on GitHub
Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP
☆245Jun 10, 2022Updated 4 years ago
YatingMusic / MusiConGen
View on GitHub
☆88Oct 20, 2024Updated last year
symphonynet / SymphonyNet
View on GitHub
Symphony Generation with Permutation Invariant Language Model
☆256Oct 7, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
haidog-yaqub / DiffPitcher
View on GitHub
Diffusion-based singing voice pitch correction
☆144Sep 20, 2024Updated last year
saebyulpark / MCIC
View on GitHub
Code and Dataset for <Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases, ISMIR 2024>
☆15Nov 12, 2024Updated last year
ZeyueT / VidMuse
View on GitHub
[CVPR 2025] Repository of VidMuse
☆140Jun 7, 2025Updated last year
declare-lab / nora-1.5
View on GitHub
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
☆110Jan 11, 2026Updated 6 months ago
minju0821 / musical_instrument_retrieval
View on GitHub
☆29Jun 8, 2023Updated 3 years ago
Apple-jun / FilmComposer
View on GitHub
Music production for silent film clips.
☆34Apr 30, 2025Updated last year
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago