shansongliu/MuMu-LLaMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shansongliu/MuMu-LLaMA)

shansongliu / MuMu-LLaMA

This is the official repository for M2UGen

☆513

Alternatives and similar repositories for MuMu-LLaMA

Users that are interested in MuMu-LLaMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
yizhilll / MERT
View on GitHub
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆481May 25, 2025Updated last year
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
0417keito / JEN-1-pytorch
View on GitHub
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…
☆55Jan 18, 2024Updated 2 years ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ldzhangyx / instruct-MusicGen
View on GitHub
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…
☆109Jan 14, 2026Updated 6 months ago
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆359Aug 4, 2025Updated 11 months ago
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆250May 11, 2025Updated last year
bytedance / Make-An-Audio-2
View on GitHub
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
☆197May 29, 2024Updated 2 years ago
RetroCirce / MusicLDM
View on GitHub
The latent diffusion model for text-to-music generation.
☆187Jan 26, 2024Updated 2 years ago
minzwon / musicfm
View on GitHub
☆268Feb 14, 2024Updated 2 years ago
QwenAudio / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,371May 20, 2025Updated last year
spotify-research / llark
View on GitHub
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, an…
☆384May 30, 2024Updated 2 years ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
happylittlecat2333 / Auffusion
View on GitHub
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…
☆194Mar 25, 2024Updated 2 years ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆745Jun 5, 2025Updated last year
Audio-AGI / WavJourney
View on GitHub
WavJourney: Compositional Audio Creation with LLMs
☆544Sep 28, 2023Updated 2 years ago
yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆605Jul 22, 2024Updated 2 years ago
ZeyueT / VidMuse
View on GitHub
[CVPR 2025] Repository of VidMuse
☆140Jun 7, 2025Updated last year
Stability-AI / stable-audio-metrics
View on GitHub
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
☆300Updated this week
hf-lin / ChatMusician
View on GitHub
☆316Apr 24, 2024Updated 2 years ago
sizhelee / Diff-BGM
View on GitHub
official code for CVPR'24 paper Diff-BGM
☆71Oct 12, 2024Updated last year
jryban / frechet-music-distance
View on GitHub
A library for computing Frechet Music Distance.
☆31Feb 4, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Text-to-Audio / Make-An-Audio-3
View on GitHub
Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers
☆121May 19, 2025Updated last year
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
affige / genmusic_demo_list
View on GitHub
a list of demo websites for automatic music generation research
☆791Jul 4, 2026Updated 3 weeks ago
TiffanyBlews / MozartsTouch
View on GitHub
Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models
☆43Mar 17, 2026Updated 4 months ago
haoheliu / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆304Dec 13, 2024Updated last year
ylacombe / musicgen-dreamboothing
View on GitHub
Fine-tune your own MusicGen with LoRA
☆161Apr 26, 2024Updated 2 years ago
declare-lab / tango
View on GitHub
A family of diffusion models for text-to-audio generation.
☆1,239Jul 29, 2025Updated 11 months ago
haoheliu / AudioLDM2
View on GitHub
Text-to-Audio/Music Generation
☆2,636Sep 29, 2024Updated last year
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
liusongxiang / Large-Audio-Models
View on GitHub
Keep track of big models in audio domain, including speech, singing, music etc.
☆515Jul 3, 2026Updated 3 weeks ago
microsoft / muzic
View on GitHub
Muzic: Music Understanding and Generation with Artificial Intelligence
☆4,937Oct 12, 2024Updated last year
fundwotsai2001 / MuseControlLite
View on GitHub
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]
☆68Jan 6, 2026Updated 6 months ago
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,826Updated this week
LAION-AI / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆2,229May 15, 2025Updated last year
mir-aidj / all-in-one
View on GitHub
All-In-One Music Structure Analyzer
☆808May 9, 2024Updated 2 years ago
yongyizang / AreYouReallyListening
View on GitHub
Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"
☆20Aug 18, 2025Updated 11 months ago