schowdhury671/melfusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/schowdhury671/melfusion)

schowdhury671 / melfusion

☆58

Alternatives and similar repositories for melfusion

Users that are interested in melfusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sizhelee / Diff-BGM
View on GitHub
official code for CVPR'24 paper Diff-BGM
☆71Oct 12, 2024Updated last year
chouliuzuo / GVMGen
View on GitHub
☆32Nov 10, 2025Updated 8 months ago
justivanr / art2mus_
View on GitHub
Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…
☆20Oct 20, 2025Updated 9 months ago
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
luosiallen / Diff-Foley
View on GitHub
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
☆206May 29, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
0417keito / JEN-1-pytorch
View on GitHub
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…
☆55Jan 18, 2024Updated 2 years ago
cyanbx / Frieren-V2A
View on GitHub
Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)
☆63Apr 3, 2025Updated last year
TiffanyBlews / MozartsTouch
View on GitHub
Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models
☆43Mar 17, 2026Updated 4 months ago
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
YatingMusic / MusiConGen
View on GitHub
☆88Oct 20, 2024Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
Text-to-Audio / Make-An-Audio-3
View on GitHub
Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers
☆121May 19, 2025Updated last year
shansongliu / MuMu-LLaMA
View on GitHub
This is the official repository for M2UGen
☆513Jan 2, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
y-ren16 / OV-InstructTTS
View on GitHub
☆22Jan 27, 2026Updated 6 months ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
shyammarjit / LR0.FM
View on GitHub
LR0.FM: Low-Resolution Zero-shot Classification Benchmark For Foundation Models
☆16Aug 29, 2025Updated 11 months ago
seungheondoh / music_caps_dl
View on GitHub
Unofficial download repository for MusicCaps
☆47Apr 21, 2023Updated 3 years ago
Eps-Acoustic-Revolution-Lab / EAR_HEAR
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
wjc2830 / MelQCD-main
View on GitHub
☆32Mar 14, 2025Updated last year
wbs2788 / MTM
View on GitHub
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…
☆28Jan 21, 2025Updated last year
ZeyueT / VidMuse
View on GitHub
[CVPR 2025] Repository of VidMuse
☆140Jun 7, 2025Updated last year
gioannides / Density-Adaptive-JEPA
View on GitHub
☆32Dec 7, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
naver-ai / rewas
View on GitHub
Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"
☆44Dec 13, 2024Updated last year
Suikasxt / PMG
View on GitHub
The repository of paper Personalized Multimodal Response Generation with Large Language Models
☆18Jun 28, 2024Updated 2 years ago
yongaifadian1 / MNV-17
View on GitHub
Qwen2.5-Omni fine-tuned on MNV-17 dataset for nonverbal vocalization recognition
☆31Nov 13, 2025Updated 8 months ago
RS2002 / Adversarial-MidiBERT
View on GitHub
[ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …
☆19Aug 17, 2025Updated 11 months ago
v-iashin / SpecVQGAN
View on GitHub
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
☆372Jul 12, 2024Updated 2 years ago
vgbench / VGBench
View on GitHub
☆19Sep 19, 2024Updated last year
Pliploop / SLAP
View on GitHub
Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding
☆63Sep 25, 2025Updated 10 months ago
nateraw / download-musiccaps-dataset
View on GitHub
Download the MusicCaps dataset for music captioning
☆115May 19, 2026Updated 2 months ago
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
astradzhao / music-rfm
View on GitHub
Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…
☆40Oct 26, 2025Updated 9 months ago
Zhao-Yian / iSegMan
View on GitHub
[CVPR 2025] iSegMan: Interactive Segment-and-Manipulate 3D Gaussians 🔥🔥🔥
☆23Mar 12, 2025Updated last year
triton99 / MDSGen
View on GitHub
[ICLR'25] MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation
☆39Dec 25, 2025Updated 7 months ago
Ava4Everr / CodeHS-Java-APCSA
View on GitHub
Just a copy of https://github.com/RobynE23/CodeHS-Java-APCSA, but I added folders and some extra files that didn't exist. Another option …
☆27Jan 23, 2024Updated 2 years ago
Dongjiahua / VICA-NeRF
View on GitHub
☆42Aug 16, 2024Updated last year
taegyeong-lee / Generating-Realistic-Images-from-In-the-wild-Sounds
View on GitHub
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023
☆12Aug 24, 2025Updated 11 months ago
Apple-jun / FilmComposer
View on GitHub
Music production for silent film clips.
☆34Apr 30, 2025Updated last year