Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
☆193Jul 30, 2024Updated last year
Alternatives and similar repositories for Video2Music
Users that are interested in Video2Music are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Mar 29, 2024Updated 2 years ago
- ☆13Aug 21, 2022Updated 3 years ago
- ☆29Nov 10, 2025Updated 5 months ago
- Mustango: Toward Controllable Text-to-Music Generation☆385Jun 2, 2025Updated 10 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer☆322Jun 8, 2025Updated 10 months ago
- Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses☆22Oct 3, 2023Updated 2 years ago
- ☆39Apr 15, 2024Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆169Dec 22, 2023Updated 2 years ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆48May 24, 2025Updated 10 months ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆347Apr 8, 2024Updated 2 years ago
- music generation with masked transformers!☆349May 16, 2025Updated 11 months ago
- ☆25Apr 18, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the official implementation of MusER (AAAI'24).☆30Jun 4, 2025Updated 10 months ago
- ☆32Nov 25, 2023Updated 2 years ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆451May 25, 2025Updated 10 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆52Jul 28, 2025Updated 8 months ago
- A large-scale dataset of caption-annotated MIDI files.☆79Jul 23, 2024Updated last year
- [JCMS 2024] This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.☆202Apr 10, 2024Updated 2 years ago
- This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…☆12Jul 29, 2025Updated 8 months ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆38Mar 10, 2023Updated 3 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)☆155Mar 14, 2024Updated 2 years ago
- MU-LLaMA: Music Understanding Large Language Model☆305Aug 18, 2025Updated 7 months ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Oct 8, 2024Updated last year
- Symphony Generation with Permutation Invariant Language Model☆255Oct 7, 2022Updated 3 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆246Jun 10, 2022Updated 3 years ago
- ☆87Oct 20, 2024Updated last year
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆41Jan 17, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Diffusion-based singing voice pitch correction☆140Sep 20, 2024Updated last year
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 8 months ago
- Code and Dataset for <Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases, ISMIR 2024>☆14Nov 12, 2024Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Jun 11, 2024Updated last year
- ☆58Nov 2, 2020Updated 5 years ago
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆87Jul 16, 2024Updated last year