☆29Nov 10, 2025Updated 5 months ago
Alternatives and similar repositories for GVMGen
Users that are interested in GVMGen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.☆121Aug 9, 2025Updated 8 months ago
- The official code for “Dance-to-Music Generation with Encoder-based Textual Inversion“☆22Jun 17, 2025Updated 9 months ago
- Music production for silent film clips.☆32Apr 30, 2025Updated 11 months ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Mar 29, 2024Updated 2 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Aug 21, 2022Updated 3 years ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Oct 8, 2024Updated last year
- A library for computing Frechet Music Distance.☆31Feb 4, 2025Updated last year
- Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model☆193Jul 30, 2024Updated last year
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- ☆57Oct 10, 2024Updated last year
- A curated list of Vision (video/image) to Audio Generation☆105Feb 10, 2026Updated 2 months ago
- ☆127Jun 7, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching [HCMIR 2023]☆51Sep 19, 2023Updated 2 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 10 months ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- This is the official implementation of RL-Chord (TNNLS).☆13Jan 2, 2024Updated 2 years ago
- This is the official implementation of MusER (AAAI'24).☆30Jun 4, 2025Updated 10 months ago
- Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models☆43Mar 17, 2026Updated 3 weeks ago
- Just a copy of https://github.com/RobynE23/CodeHS-Java-APCSA, but I added folders and some extra files that didn't exist. Another option …☆27Jan 23, 2024Updated 2 years ago
- A curated list of resources in audio visual question answering and related area. :-)☆17Jun 29, 2025Updated 9 months ago
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a cog implementation of the fine-tuner for Meta's MusicGen☆54Apr 5, 2024Updated 2 years ago
- [CVPR 2025] Plug-and-Play Versatile Compressed Video Enhancement☆22Jan 19, 2026Updated 2 months ago
- ☆68Dec 30, 2025Updated 3 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆48May 24, 2025Updated 10 months ago
- ☆38Dec 18, 2025Updated 3 months ago
- a python library for midi to wav, generation, visualization, which is design for machine learning☆11Mar 25, 2019Updated 7 years ago
- Official code for SongEcho☆55Mar 3, 2026Updated last month
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆62Jul 28, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆232May 11, 2025Updated 11 months ago
- ☆40Apr 14, 2025Updated last year
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆48Dec 16, 2025Updated 4 months ago
- A list of papers and other resources on deep learning with anime style images.☆17Feb 8, 2018Updated 8 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 10 months ago
- Materialist: Physically Based Editing Using Single-Image Inverse Rendering☆26Oct 24, 2025Updated 5 months ago
- Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.☆32Jan 7, 2025Updated last year