spotify-research/llark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spotify-research/llark)

spotify-research / llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

☆384

Alternatives and similar repositories for llark

Users that are interested in llark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
minzwon / musicfm
View on GitHub
☆268Feb 14, 2024Updated 2 years ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
yizhilll / MERT
View on GitHub
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆481May 25, 2025Updated last year
mulab-mir / song-describer-dataset
View on GitHub
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
☆175Dec 22, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
habla-liaa / encodecmae
View on GitHub
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
☆101Jul 24, 2024Updated 2 years ago
zihaod / MusiLingo
View on GitHub
☆50Aug 27, 2024Updated last year
p-lambda / jukemir
View on GitHub
Perform transfer learning for MIR using Jukebox!
☆188Oct 12, 2023Updated 2 years ago
mir-dataset-loaders / mirdata
View on GitHub
Python library for working with Music Information Retrieval datasets
☆412Jul 14, 2026Updated last week
Audio-AGI / WavJourney
View on GitHub
WavJourney: Compositional Audio Creation with LLMs
☆544Sep 28, 2023Updated 2 years ago
affige / genmusic_demo_list
View on GitHub
a list of demo websites for automatic music generation research
☆791Jul 4, 2026Updated 3 weeks ago
Natooz / MidiTok
View on GitHub
MIDI / symbolic music tokenizers for Deep Learning models 🎶
☆884Updated this week
ldzhangyx / BART-fusion
View on GitHub
The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".
☆24Dec 12, 2022Updated 3 years ago
mir-aidj / all-in-one
View on GitHub
All-In-One Music Structure Analyzer
☆808May 9, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
minzwon / semi-supervised-music-tagging-transformer
View on GitHub
☆99Nov 25, 2021Updated 4 years ago
LAION-AI / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆2,229May 15, 2025Updated last year
ilaria-manco / muscall
View on GitHub
Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)
☆122Dec 5, 2024Updated last year
ilaria-manco / multimodal-ml-music
View on GitHub
List of academic resources on Multimodal ML for Music
☆298Mar 25, 2023Updated 3 years ago
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
RetroCirce / MusicLDM
View on GitHub
The latent diffusion model for text-to-music generation.
☆187Jan 26, 2024Updated 2 years ago
minzwon / sota-music-tagging-models
View on GitHub
☆440Nov 1, 2023Updated 2 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
gudgud96 / frechet-audio-distance
View on GitHub
A lightweight library for Frechet Audio Distance calculation.
☆317Feb 11, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
WildHoneyPie / BEAST
View on GitHub
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…
☆44Sep 11, 2024Updated last year
microsoft / fadtk
View on GitHub
A simple library for Fréchet Audio Distance (FAD) calculation
☆266Aug 22, 2025Updated 11 months ago
seungheondoh / msd-subsets
View on GitHub
million song dataset split for extended clean tag & artist-level stratified
☆52Aug 12, 2023Updated 2 years ago
seungheondoh / music-text-representation-pp
View on GitHub
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]
☆43Oct 7, 2024Updated last year
affige / DeepMIR
View on GitHub
Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University
☆237Dec 1, 2025Updated 7 months ago
ldzhangyx / simplified-jukemir
View on GitHub
A minimum JukeMIR branch for feature extraction.
☆32Mar 31, 2022Updated 4 years ago
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆250May 11, 2025Updated last year
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆359Aug 4, 2025Updated 11 months ago
Hayeonbang / PIAST
View on GitHub
A piano music dataset with Audio, Symbolic and Text labels
☆36Mar 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
a43992899 / MARBLE
View on GitHub
State-of-the-art pretrained music models for training, evaluation, inference
☆183Jan 20, 2026Updated 6 months ago
ilaria-manco / song-describer
View on GitHub
Song Describer is a data collection platform for annotating music with textual descriptions.
☆61Dec 3, 2024Updated last year
groupmm / synctoolbox
View on GitHub
A Python toolbox with reference implementations for efficient, robust, and accurate music synchronization based on dynamic time warping (…
☆138May 28, 2026Updated last month
Stability-AI / stable-audio-metrics
View on GitHub
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
☆300Updated this week
nii-yamagishilab / midi-to-audio
View on GitHub
Project for MIDI to Audio Synthesis
☆28Mar 13, 2023Updated 3 years ago
NVIDIA / audio-flamingo
View on GitHub
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
☆1,160Dec 15, 2025Updated 7 months ago