shtdbb / MusicTextAlignmentLinks
This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal models in music-text alignment tasks, similar to how visual-LLM align image encodings with textual embeddings.
☆12Updated 2 years ago
Alternatives and similar repositories for MusicTextAlignment
Users that are interested in MusicTextAlignment are comparing it to the libraries listed below
Sorting:
- [AAAI'24] Official dataset & demo code for MID-FiLD: MIDI Dataset for Fine-Level Dynamics☆20Updated last year
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆49Updated 4 months ago
- ☆18Updated last year
- ☆13Updated 7 years ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆27Updated 2 years ago
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆49Updated 5 months ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Updated 2 years ago
- Official source codes of airsep☆38Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆36Updated last month
- Official Implementation of "Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music" (ISMIR 2021)☆59Updated 2 years ago
- A large-scale dataset of caption-annotated MIDI files.☆75Updated last year
- ☆10Updated 2 years ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆68Updated 5 months ago
- Music Generative Pretrained Transformer☆27Updated 3 years ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆87Updated 11 months ago
- TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching [HCMIR 2023]☆49Updated 2 years ago
- ☆12Updated 2 years ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated 10 months ago
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆118Updated 3 months ago
- ☆48Updated 2 years ago
- ☆51Updated last year
- Audio-to-Score Alignment Using Deep Automatic Music Transcription☆46Updated 3 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆44Updated 2 years ago
- A python package for high level musical data manipulation and preprocessing, making data ready to be fed to a neural network.☆41Updated 3 years ago
- Real-time end-to-end singing voice convertion☆22Updated last year
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆40Updated last year
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆44Updated last year
- SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours☆28Updated 6 months ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆19Updated 4 years ago