shtdbb / MusicTextAlignmentLinks
This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal models in music-text alignment tasks, similar to how visual-LLM align image encodings with textual embeddings.
☆12Updated last year
Alternatives and similar repositories for MusicTextAlignment
Users that are interested in MusicTextAlignment are comparing it to the libraries listed below
Sorting:
- [AAAI'24] Official dataset & demo code for MID-FiLD: MIDI Dataset for Fine-Level Dynamics☆18Updated last year
- ☆18Updated last year
- Official Implementation of "Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music" (ISMIR 2021)☆59Updated 2 years ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- Music Generative Pretrained Transformer☆27Updated 3 years ago
- 🎹 MIDI Generator Piano Roll☆13Updated last month
- ☆10Updated 2 years ago
- ☆12Updated 6 years ago
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆48Updated 3 months ago
- ☆11Updated last year
- A large-scale dataset of caption-annotated MIDI files.☆75Updated last year
- TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching [HCMIR 2023]☆48Updated 2 years ago
- SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours☆28Updated 4 months ago
- Official source codes of airsep☆38Updated last year
- A python package for high level musical data manipulation and preprocessing, making data ready to be fed to a neural network.☆41Updated 3 years ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆48Updated 2 months ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Updated 8 months ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆87Updated 9 months ago
- fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes CU…☆22Updated last year
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Updated last year
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆20Updated last year
- SOTA kilo-scale MIDI dataset for MIR and Music AI purposes☆60Updated last year
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆61Updated 3 months ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆19Updated 4 years ago
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆15Updated last year
- ☆11Updated last year
- Full-attention multi-instrumental music transformer featuring asymmetrical encoding with octo-velocity, and chords counters tokens, optim…☆46Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆44Updated 2 years ago
- Algorithms to automatically recognize guitar effects and retrieve their parameters for timbre reproduction☆25Updated 3 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆34Updated last month