seungheondoh/music_caps_dl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seungheondoh/music_caps_dl)

seungheondoh / music_caps_dl

Unofficial download repository for MusicCaps

☆47

Alternatives and similar repositories for music_caps_dl

Users that are interested in music_caps_dl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

seungheondoh / msd-subsets
View on GitHub
million song dataset split for extended clean tag & artist-level stratified
☆52Aug 12, 2023Updated 2 years ago
nateraw / download-musiccaps-dataset
View on GitHub
Download the MusicCaps dataset for music captioning
☆115May 19, 2026Updated 2 months ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
christofw / multipitch_architectures
View on GitHub
Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …
☆15Aug 26, 2022Updated 3 years ago
0417keito / JEN-1-pytorch
View on GitHub
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…
☆55Jan 18, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
biboamy / music-repro
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
yoyolicoris / music-spectrogram-diffusion-pytorch
View on GitHub
☆88Jan 29, 2023Updated 3 years ago
ilaria-manco / muscaps
View on GitHub
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
☆85Dec 3, 2024Updated last year
camenduru / audioldm-colab
View on GitHub
AudioLDM text to audio colab
☆18Nov 6, 2023Updated 2 years ago
ilaria-manco / mulap
View on GitHub
Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)
☆47Dec 3, 2024Updated last year
gladia-research-group / cocola
View on GitHub
☆39Jan 9, 2026Updated 6 months ago
justivanr / art2mus_
View on GitHub
Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…
☆20Oct 20, 2025Updated 9 months ago
suncerock / EAsT-music-classification
View on GitHub
Audio Embeddings as Teachers for Music Classification
☆13Sep 7, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jihoojung0106 / open-singsong
View on GitHub
Open SingSong - Implementation of 'SingSong: Generating Musical Accompaniments from Singing' by Google Research, with a few modifications
☆17Jun 10, 2024Updated 2 years ago
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
yukara-ikemiya / minimal-musicgen-for-developers
View on GitHub
[PyTorch] Minimal codebase for MusicGen models
☆63Jan 7, 2025Updated last year
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
seungheondoh / music-text-representation
View on GitHub
Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]
☆113Aug 12, 2023Updated 2 years ago
sudongtan / synesthesia
View on GitHub
☆13Oct 3, 2023Updated 2 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
audio-captioning / audio-captioning-resources
View on GitHub
A list of resources that can help in research for automated audio captioning
☆34Feb 17, 2021Updated 5 years ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kyungyunlee / fins
View on GitHub
Implementation of FiNS model for RIR estimation
☆38Nov 1, 2023Updated 2 years ago
mulab-mir / song-describer-dataset
View on GitHub
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
☆176Dec 22, 2023Updated 2 years ago
zihaod / MusiLingo
View on GitHub
☆50Aug 27, 2024Updated last year
josephding23 / Free-Midi-Library
View on GitHub
Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!
☆33Jun 6, 2020Updated 6 years ago
hearbenchmark / hear-eval-kit
View on GitHub
Evaluation kit for the HEAR Benchmark
☆65Feb 12, 2026Updated 5 months ago
a43992899 / openl2s
View on GitHub
Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.
☆17May 9, 2025Updated last year
mae-creative-pc / cpac_course_2024-25
View on GitHub
☆13Dec 12, 2025Updated 7 months ago
seungheondoh / EMOPIA_cls
View on GitHub
MIDI, WAV domain music emotion recognition [ISMIR 2021]
☆90Oct 29, 2021Updated 4 years ago
f90 / jamendolyrics
View on GitHub
DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
☆88Apr 30, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
spotify-research / llark
View on GitHub
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, an…
☆383May 30, 2024Updated 2 years ago
polifonia-project / music-meta-ontology
View on GitHub
A flexible ontology for the interoperability of music metadata
☆27Mar 22, 2024Updated 2 years ago
groupmm / libf0
View on GitHub
A Python Library for Fundamental Frequency Estimation in Music Recordings
☆55Jun 5, 2026Updated last month
MTG / mtg-jamendo-dataset
View on GitHub
Metadata, scripts and baselines for the MTG-Jamendo dataset
☆400Mar 18, 2026Updated 4 months ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
dlrudco / Fast-Audioset-Download
View on GitHub
Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing
☆48Aug 1, 2024Updated last year