chouliuzuo/GVMGen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chouliuzuo/GVMGen)

chouliuzuo / GVMGen

☆32

Alternatives and similar repositories for GVMGen

Users that are interested in GVMGen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Apple-jun / FilmComposer
View on GitHub
Music production for silent film clips.
☆34Apr 30, 2025Updated last year
Littleor / Personalized-DMER
View on GitHub
Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…
☆14Mar 24, 2025Updated last year
wzk1015 / Awesome-Vision-to-Music-Generation
View on GitHub
[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.
☆126Aug 9, 2025Updated 11 months ago
wbs2788 / MTM
View on GitHub
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…
☆28Jan 21, 2025Updated last year
lsfhuihuiff / Dance-to-music_Siggraph_Asia_2024
View on GitHub
The official code for “Dance-to-Music Generation with Encoder-based Textual Inversion“
☆23Jun 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhuole1025 / SymMV
View on GitHub
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
☆78Mar 29, 2024Updated 2 years ago
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
ZeyueT / VidMuse
View on GitHub
[CVPR 2025] Repository of VidMuse
☆140Jun 7, 2025Updated last year
ivyha010 / EmoMV
View on GitHub
Datasets for affective music‑video retrieval
☆13Aug 21, 2022Updated 3 years ago
jryban / frechet-music-distance
View on GitHub
A library for computing Frechet Music Distance.
☆31Feb 4, 2025Updated last year
crypto-code / Music-Representation-Comparison
View on GitHub
This is the repo with the code to conduct a comparative analysis of different audio representation models.
☆11Aug 31, 2023Updated 2 years ago
sizhelee / Diff-BGM
View on GitHub
official code for CVPR'24 paper Diff-BGM
☆71Oct 12, 2024Updated last year
schowdhury671 / melfusion
View on GitHub
☆58Oct 10, 2024Updated last year
AMAAI-Lab / Video2Music
View on GitHub
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
☆196Jul 30, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Xiaohao-Liu / Awesome-Vison2Audio
View on GitHub
A curated list of Vision (video/image) to Audio Generation
☆107Feb 10, 2026Updated 5 months ago
a43992899 / openl2s
View on GitHub
Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.
☆17May 9, 2025Updated last year
fundwotsai2001 / Text-to-music-dataset-preparation
View on GitHub
A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]
☆28May 20, 2025Updated last year
sander-wood / tunesformer
View on GitHub
TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching [HCMIR 2023]
☆51Sep 19, 2023Updated 2 years ago
Tayjsl97 / RL-Chord
View on GitHub
This is the official implementation of RL-Chord (TNNLS).
☆13Jan 2, 2024Updated 2 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
Tayjsl97 / MusER
View on GitHub
This is the official implementation of MusER (AAAI'24).
☆31Jun 4, 2025Updated last year
Ava4Everr / CodeHS-Java-APCSA
View on GitHub
Just a copy of https://github.com/RobynE23/CodeHS-Java-APCSA, but I added folders and some extra files that didn't exist. Another option …
☆27Jan 23, 2024Updated 2 years ago
TiffanyBlews / MozartsTouch
View on GitHub
Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models
☆43Mar 17, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sakemin / cog-musicgen-fine-tuner
View on GitHub
This is a cog implementation of the fine-tuner for Meta's MusicGen
☆55Apr 5, 2024Updated 2 years ago
taegyeong-lee / Generating-Realistic-Images-from-In-the-wild-Sounds
View on GitHub
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023
☆12Aug 24, 2025Updated 11 months ago
exeex / mimi
View on GitHub
a python library for midi to wav, generation, visualization, which is design for machine learning
☆11Mar 25, 2019Updated 7 years ago
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
ETH-DISCO / blap
View on GitHub
Official repo for BLAP: Bootstrapping Language-Audio Pre-training for Music Captioning presented at ICASSP 2025
☆16Nov 18, 2024Updated last year
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
fyyCS / LSLD
View on GitHub
☆14Nov 13, 2023Updated 2 years ago
OpenGVLab / LORIS
View on GitHub
[ICML2023] Long-Term Rhythmic Video Soundtracker
☆63Jul 28, 2025Updated last year
lsfhuihuiff / SongEcho_ICLR2026
View on GitHub
Official code for SongEcho
☆64Mar 3, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
swarupbehera / awesome-audio-visual-question-answering
View on GitHub
A curated list of resources in audio visual question answering and related area. :-)
☆17Jun 29, 2025Updated last year
Stability-AI / stable-audio-metrics
View on GitHub
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
☆300Updated this week
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 4 years ago
kaist-ami / Sound2Scene
View on GitHub
☆42Apr 14, 2025Updated last year
jorshi / drumblender
View on GitHub
Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.
☆35Jan 7, 2025Updated last year
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆250May 11, 2025Updated last year