Unofficial download repository for MusicCaps
☆47Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for music_caps_dl
Users that are interested in music_caps_dl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- million song dataset split for extended clean tag & artist-level stratified☆52Aug 12, 2023Updated 2 years ago
- Download the MusicCaps dataset for music captioning☆113Feb 11, 2025Updated last year
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆346Apr 8, 2024Updated last year
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆19Oct 20, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆55Jan 18, 2024Updated 2 years ago
- MU-LLaMA: Music Understanding Large Language Model☆305Aug 18, 2025Updated 7 months ago
- ☆87Jan 29, 2023Updated 3 years ago
- AudioLDM text to audio colab☆19Nov 6, 2023Updated 2 years ago
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆85Dec 3, 2024Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- ☆39Jan 9, 2026Updated 2 months ago
- Audio Embeddings as Teachers for Music Classification☆13Sep 7, 2023Updated 2 years ago
- ☆17Nov 7, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 10 months ago
- Open SingSong - Implementation of 'SingSong: Generating Musical Accompaniments from Singing' by Google Research, with a few modifications☆16Jun 10, 2024Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Aug 12, 2023Updated 2 years ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆52Jul 28, 2025Updated 8 months ago
- [PyTorch] Minimal codebase for MusicGen models☆63Jan 7, 2025Updated last year
- ☆13Oct 3, 2023Updated 2 years ago
- Mustango: Toward Controllable Text-to-Music Generation☆387Jun 2, 2025Updated 9 months ago
- ☆50Aug 27, 2024Updated last year
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆168Dec 22, 2023Updated 2 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆34Apr 22, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- Implementation of FiNS model for RIR estimation☆38Nov 1, 2023Updated 2 years ago
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆32Jun 6, 2020Updated 5 years ago
- Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual☆55Mar 22, 2026Updated last week
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆89Apr 30, 2025Updated 11 months ago
- MIDI, WAV domain music emotion recognition [ISMIR 2021]☆88Oct 29, 2021Updated 4 years ago
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated last month
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, an…☆378May 30, 2024Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- Metadata, scripts and baselines for the MTG-Jamendo dataset☆372Mar 18, 2026Updated last week
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆231May 11, 2025Updated 10 months ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆56Jan 16, 2026Updated 2 months ago
- Implementation of the paper "Can Large Language Models Predict Audio Effects Parameters from Natural Language?"☆29May 27, 2025Updated 10 months ago
- Self-supervised learning for real-time pitch estimation☆282Oct 15, 2025Updated 5 months ago