seungheondoh / music-text-representation-pp
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]
☆33Updated 5 months ago
Alternatives and similar repositories for music-text-representation-pp:
Users that are interested in music-text-representation-pp are comparing it to the libraries listed below
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- music semantic understanding evaluation benchmark☆25Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated last week
- million song dataset split for extended clean tag & artist-level stratified☆48Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- ☆43Updated 9 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆33Updated 3 months ago
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 10 months ago
- Official Implementation of Jointist☆33Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆43Updated 5 months ago
- Landing Page for All Things Source Separation☆22Updated 4 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 3 months ago
- ☆79Updated 2 years ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆36Updated this week
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆40Updated last month
- list of MIR dataset papers presented at ISMIR 2022☆61Updated 2 years ago
- Project for MIDI to Audio Synthesis☆22Updated 2 years ago
- ☆43Updated last year
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆44Updated 3 weeks ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 6 months ago
- ☆17Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆25Updated 10 months ago
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆23Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 6 months ago
- ☆16Updated 4 months ago