ilaria-manco/muscaps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ilaria-manco/muscaps)

ilaria-manco / muscaps

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

☆85

Alternatives and similar repositories for muscaps

Users that are interested in muscaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ilaria-manco / mulap
View on GitHub
Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)
☆47Dec 3, 2024Updated last year
zihaod / MusiLingo
View on GitHub
☆50Aug 27, 2024Updated last year
cyrusasfa / timbre2020-sounds
View on GitHub
Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)
☆12Dec 17, 2021Updated 4 years ago
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZacharyNovack / Lead-AE
View on GitHub
Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression
☆22Oct 23, 2023Updated 2 years ago
minzwon / sota-music-tagging-models
View on GitHub
☆439Nov 1, 2023Updated 2 years ago
Kikyo-16 / coco-mulla-repo
View on GitHub
Official source codes of coco-mulla
☆36Mar 21, 2024Updated 2 years ago
f90 / Mix-Wave-U-Net
View on GitHub
Wave-U-Net for automatic (drum) mixing
☆38Mar 24, 2023Updated 3 years ago
ldzhangyx / BART-fusion
View on GitHub
The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".
☆24Dec 12, 2022Updated 3 years ago
yukara-ikemiya / minimal-musicgen-for-developers
View on GitHub
[PyTorch] Minimal codebase for MusicGen models
☆63Jan 7, 2025Updated last year
shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
minzwon / semi-supervised-music-tagging-transformer
View on GitHub
☆99Nov 25, 2021Updated 4 years ago
suncerock / EAsT-music-classification
View on GitHub
Audio Embeddings as Teachers for Music Classification
☆13Sep 7, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
guozixunnicolas / FundamentalMusicEmbedding
View on GitHub
☆32Nov 25, 2023Updated 2 years ago
seungheondoh / music-text-representation
View on GitHub
Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]
☆113Aug 12, 2023Updated 2 years ago
janzuiderveld / continuous-audio-representations
View on GitHub
Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".
☆21Dec 3, 2021Updated 4 years ago
mulab-mir / muchomusic
View on GitHub
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
☆46Dec 3, 2024Updated last year
aim-qmul / sdx23-aimless
View on GitHub
Source Separation training codebase for the Sound Demixing Challenge 2023.
☆45May 18, 2023Updated 3 years ago
ws-choi / LASAFT-Net-v2
View on GitHub
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Apr 11, 2022Updated 4 years ago
Spijkervet / CLMR
View on GitHub
Official PyTorch implementation of Contrastive Learning of Musical Representations
☆338Jul 25, 2024Updated 2 years ago
aframires / freesound-loop-annotator
View on GitHub
A web app for annotating Freesound loops, and the tools to analyse the dataset created.
☆20Jul 6, 2023Updated 3 years ago
minzwon / musicfm
View on GitHub
☆269Feb 14, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hugofloresgarcia / music-trees
View on GitHub
Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…
☆41Aug 12, 2022Updated 3 years ago
RetroCirce / MusicLDM
View on GitHub
The latent diffusion model for text-to-music generation.
☆188Jan 26, 2024Updated 2 years ago
emirdemirel / ASA_ICASSP2021
View on GitHub
A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…
☆15Oct 13, 2022Updated 3 years ago
andrebola / contrastive-mir-learning
View on GitHub
This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"
☆15Jun 22, 2023Updated 3 years ago
ilaria-manco / word2wave
View on GitHub
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
☆118Dec 13, 2021Updated 4 years ago
seungheondoh / speech-to-music
View on GitHub
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Aug 16, 2023Updated 2 years ago
Natooz / BPE-Symbolic-Music
View on GitHub
Code of the paper "Byte Pair Encoding for Symbolic Music" (EMNLP 2023). Better and faster music generation
☆45Mar 6, 2024Updated 2 years ago
MALerLab / SejongMusic
View on GitHub
Official Repository of Six Dragons Fly Again (ISMIR 2024)
☆15Nov 13, 2025Updated 8 months ago
yongyizang / AreYouReallyListening
View on GitHub
Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"
☆20Aug 18, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
ismir-24-sub / unsupervised_compositional_representations
View on GitHub
ISMIR 24 Supplementary Material
☆14Oct 28, 2024Updated last year
madhavlab / wav2tok
View on GitHub
Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Jun 30, 2026Updated 3 weeks ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
seungheondoh / music_caps_dl
View on GitHub
Unofficial download repository for MusicCaps
☆47Apr 21, 2023Updated 3 years ago
mdx-workshop / mdx-submissions21
View on GitHub
Music Demixing Challenge Submission Repo
☆16Sep 8, 2023Updated 2 years ago
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆250May 11, 2025Updated last year