Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
☆193Jul 30, 2024Updated last year
Alternatives and similar repositories for Video2Music
Users that are interested in Video2Music are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Mar 29, 2024Updated 2 years ago
- ☆13Aug 21, 2022Updated 3 years ago
- ☆31Nov 10, 2025Updated 6 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆15Apr 22, 2026Updated last month
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆51May 24, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Mustango: Toward Controllable Text-to-Music Generation☆391Jun 2, 2025Updated 11 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- [ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer☆324Jun 8, 2025Updated 11 months ago
- Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses☆22Oct 3, 2023Updated 2 years ago
- ☆40Apr 15, 2024Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆171Dec 22, 2023Updated 2 years ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆348Apr 8, 2024Updated 2 years ago
- music generation with masked transformers!☆352May 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆26Apr 18, 2025Updated last year
- This is the official implementation of MusER (AAAI'24).☆30Jun 4, 2025Updated 11 months ago
- ☆32Nov 25, 2023Updated 2 years ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆461May 25, 2025Updated last year
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆53Jul 28, 2025Updated 10 months ago
- A large-scale dataset of caption-annotated MIDI files.☆84Jul 23, 2024Updated last year
- [JCMS 2024] This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.☆205Apr 10, 2024Updated 2 years ago
- This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…☆12Jul 29, 2025Updated 10 months ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- ☆39Mar 10, 2023Updated 3 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)☆155Mar 14, 2024Updated 2 years ago
- MU-LLaMA: Music Understanding Large Language Model☆306Aug 18, 2025Updated 9 months ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Oct 8, 2024Updated last year
- Symphony Generation with Permutation Invariant Language Model☆255Oct 7, 2022Updated 3 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆245Jun 10, 2022Updated 3 years ago
- ☆87Oct 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆43Jan 17, 2026Updated 4 months ago
- Diffusion-based singing voice pitch correction☆141Sep 20, 2024Updated last year
- Improving Symbolic Music Generation with Inference-Time Alignment☆22Aug 2, 2025Updated 9 months ago
- Code and Dataset for <Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases, ISMIR 2024>☆15Nov 12, 2024Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Jun 11, 2024Updated last year