nateraw/download-musiccaps-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nateraw/download-musiccaps-dataset)

nateraw / download-musiccaps-dataset

Download the MusicCaps dataset for music captioning

☆115

Alternatives and similar repositories for download-musiccaps-dataset

Users that are interested in download-musiccaps-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

seungheondoh / music_caps_dl
View on GitHub
Unofficial download repository for MusicCaps
☆47Apr 21, 2023Updated 3 years ago
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆50Jan 23, 2025Updated last year
josephding23 / Free-Midi-Library
View on GitHub
Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!
☆33Jun 6, 2020Updated 6 years ago
seungheondoh / music-text-representation
View on GitHub
Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]
☆113Aug 12, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
0417keito / JEN-1-pytorch
View on GitHub
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…
☆55Jan 18, 2024Updated 2 years ago
seungheondoh / msd-subsets
View on GitHub
million song dataset split for extended clean tag & artist-level stratified
☆52Aug 12, 2023Updated 2 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
zhvng / open-musiclm
View on GitHub
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
☆560Jun 3, 2023Updated 3 years ago
gudgud96 / frechet-audio-distance
View on GitHub
A lightweight library for Frechet Audio Distance calculation.
☆317Feb 11, 2026Updated 5 months ago
archinetai / audio-data-pytorch
View on GitHub
A collection of useful audio datasets and transforms for PyTorch.
☆144Feb 11, 2023Updated 3 years ago
karchkha / MSG-LD
View on GitHub
Official repository for: Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
☆19Nov 21, 2025Updated 8 months ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
Kikyo-16 / coco-mulla-repo
View on GitHub
Official source codes of coco-mulla
☆36Mar 21, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zihaod / MusiLingo
View on GitHub
☆50Aug 27, 2024Updated last year
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
yoyolicoris / music-spectrogram-diffusion-pytorch
View on GitHub
☆88Jan 29, 2023Updated 3 years ago
zeyuxie29 / AudioTime
View on GitHub
☆39Jul 4, 2024Updated 2 years ago
shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
Aratako / CALM-DACVAE
View on GitHub
An attempt to reproduce CALM (Continuous Audio Language Models) using DACVAE as the audio VAE.
☆18Feb 20, 2026Updated 5 months ago
archinetai / audio-diffusion-pytorch-trainer
View on GitHub
Trainer for audio-diffusion-pytorch
☆129Jan 13, 2023Updated 3 years ago
Barbany / Multi-speaker-Neural-Vocoder
View on GitHub
Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommun…
☆16Jul 25, 2024Updated 2 years ago
mubtasimahasan / DM-Codec
View on GitHub
Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”
☆57Jun 1, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
archinetai / archisound
View on GitHub
A collection of pre-trained audio models, in PyTorch.
☆116Jan 27, 2023Updated 3 years ago
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆275Jul 29, 2023Updated 3 years ago
MTG / mtg-jamendo-dataset
View on GitHub
Metadata, scripts and baselines for the MTG-Jamendo dataset
☆400Mar 18, 2026Updated 4 months ago
wsntxxn / UniFlow-Audio
View on GitHub
☆74Jul 17, 2026Updated last week
shansongliu / MuMu-LLaMA
View on GitHub
This is the official repository for M2UGen
☆513Jan 2, 2025Updated last year
sander-wood / text-to-music
View on GitHub
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music Generation Task [AAAI 2023 Workshop]
☆79Aug 20, 2023Updated 2 years ago
facebookresearch / lst
View on GitHub
Code for Latent Speech-Text Transformer (LST)
☆35Mar 12, 2026Updated 4 months ago
yukara-ikemiya / friendly-stable-audio-tools
View on GitHub
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…
☆218Jul 25, 2024Updated 2 years ago
KdaiP / DC-Speech-VAE
View on GitHub
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Nov 19, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
ldzhangyx / instruct-MusicGen
View on GitHub
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…
☆109Jan 14, 2026Updated 6 months ago
Kinyugo / msanii
View on GitHub
A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.
☆196Apr 27, 2023Updated 3 years ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆747Jun 5, 2025Updated last year
slp-rl / SpokenStoryCloze
View on GitHub
A spoken version of the textual story cloze benchmark
☆22Aug 6, 2023Updated 2 years ago
yizhilll / MERT
View on GitHub
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆482May 25, 2025Updated last year