Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
☆85Dec 3, 2024Updated last year
Alternatives and similar repositories for muscaps
Users that are interested in muscaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- ☆50Aug 27, 2024Updated last year
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 3 years ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆346Apr 8, 2024Updated last year
- Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)☆12Dec 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆437Nov 1, 2023Updated 2 years ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Jul 4, 2021Updated 4 years ago
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆22Oct 23, 2023Updated 2 years ago
- Official source codes of coco-mulla☆36Mar 21, 2024Updated 2 years ago
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆24Dec 12, 2022Updated 3 years ago
- [PyTorch] Minimal codebase for MusicGen models☆63Jan 7, 2025Updated last year
- MU-LLaMA: Music Understanding Large Language Model☆305Aug 18, 2025Updated 7 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆44Dec 3, 2024Updated last year
- ☆99Nov 25, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Audio Embeddings as Teachers for Music Classification☆13Sep 7, 2023Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Aug 12, 2023Updated 2 years ago
- ☆252Feb 14, 2024Updated 2 years ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆335Jul 25, 2024Updated last year
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆21Dec 3, 2021Updated 4 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- Code of the paper "Byte Pair Encoding for Symbolic Music" (EMNLP 2023). Better and faster music generation☆44Mar 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The latent diffusion model for text-to-music generation.☆185Jan 26, 2024Updated 2 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆120Dec 13, 2021Updated 4 years ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Aug 12, 2022Updated 3 years ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆231May 11, 2025Updated 10 months ago
- Official Repository of Six Dragons Fly Again (ISMIR 2024)☆13Nov 13, 2025Updated 4 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"☆20Aug 18, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Feb 10, 2026Updated last month
- ☆18Jan 20, 2025Updated last year
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- music generation with masked transformers!☆351May 16, 2025Updated 10 months ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- ☆31Mar 19, 2025Updated last year
- Unofficial download repository for MusicCaps☆47Apr 21, 2023Updated 2 years ago