State-of-the-art pretrained music models for training, evaluation, inference
☆163Jan 20, 2026Updated last month
Alternatives and similar repositories for MARBLE
Users that are interested in MARBLE are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆434May 25, 2025Updated 9 months ago
- ☆251Feb 14, 2024Updated 2 years ago
- A simple library for Fréchet Audio Distance (FAD) calculation☆246Aug 22, 2025Updated 6 months ago
- MU-LLaMA: Music Understanding Large Language Model☆303Aug 18, 2025Updated 6 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆13Oct 17, 2025Updated 4 months ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆82Nov 7, 2025Updated 3 months ago
- A Representation Evaluation Framework for Music Information Retrieval tasks☆53Apr 9, 2024Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆311Aug 4, 2025Updated 6 months ago
- PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.☆47Dec 4, 2025Updated 3 months ago
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆40Jan 17, 2026Updated last month
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆53Nov 20, 2023Updated 2 years ago
- All-In-One Music Structure Analyzer☆722May 9, 2024Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- ISMIR 2023 Papers: A complete collection of influential and exciting research papers from the ISMIR 2023 conference.☆106Dec 2, 2023Updated 2 years ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆344Apr 8, 2024Updated last year
- ☆18May 4, 2025Updated 9 months ago
- Mustango: Toward Controllable Text-to-Music Generation☆386Jun 2, 2025Updated 9 months ago
- Training, validation, and inference code for various SSL approaches and architectures.☆79Oct 22, 2025Updated 4 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆335Jul 25, 2024Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆136Feb 23, 2026Updated last week
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆100Feb 20, 2026Updated last week
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆95Jun 12, 2025Updated 8 months ago
- Readability-aware automatic lyrics transcription (ALT) evaluation toolkit☆43Aug 29, 2024Updated last year
- ☆130Feb 9, 2026Updated 3 weeks ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Jan 19, 2026Updated last month
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆167Dec 22, 2023Updated 2 years ago
- Results and Models for Learning Audio Representations of Music Content☆107Dec 3, 2024Updated last year
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆124Sep 2, 2025Updated 6 months ago
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 7 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆220May 11, 2025Updated 9 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆44Dec 3, 2024Updated last year
- Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"☆25Feb 3, 2026Updated last month
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- PyTorch Dataset for Speech and Music audio☆80Jul 12, 2024Updated last year
- Metadata, scripts and baselines for the MTG-Jamendo dataset☆365Jan 13, 2026Updated last month