Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
☆85Dec 3, 2024Updated last year
Alternatives and similar repositories for muscaps
Users that are interested in muscaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- ☆50Aug 27, 2024Updated last year
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 3 years ago
- Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)☆12Dec 17, 2021Updated 4 years ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆348Apr 8, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆435Nov 1, 2023Updated 2 years ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Jul 4, 2021Updated 4 years ago
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆22Oct 23, 2023Updated 2 years ago
- Official source codes of coco-mulla☆36Mar 21, 2024Updated 2 years ago
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆24Dec 12, 2022Updated 3 years ago
- [PyTorch] Minimal codebase for MusicGen models☆63Jan 7, 2025Updated last year
- MU-LLaMA: Music Understanding Large Language Model☆307Aug 18, 2025Updated 8 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆44Dec 3, 2024Updated last year
- ☆99Nov 25, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Audio Embeddings as Teachers for Music Classification☆13Sep 7, 2023Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Aug 12, 2023Updated 2 years ago
- ☆258Feb 14, 2024Updated 2 years ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆336Jul 25, 2024Updated last year
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆21Dec 3, 2021Updated 4 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 4 years ago
- Code of the paper "Byte Pair Encoding for Symbolic Music" (EMNLP 2023). Better and faster music generation☆45Mar 6, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The latent diffusion model for text-to-music generation.☆186Jan 26, 2024Updated 2 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆118Dec 13, 2021Updated 4 years ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Aug 12, 2022Updated 3 years ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆236May 11, 2025Updated 11 months ago
- Official Repository of Six Dragons Fly Again (ISMIR 2024)☆14Nov 13, 2025Updated 5 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"☆20Aug 18, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Feb 10, 2026Updated 2 months ago
- ☆18Jan 20, 2025Updated last year
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- music generation with masked transformers!☆351May 16, 2025Updated 11 months ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- ☆31Mar 19, 2025Updated last year
- Unofficial download repository for MusicCaps☆47Apr 21, 2023Updated 3 years ago