shtdbb / MusicTextAlignmentLinks
This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal models in music-text alignment tasks, similar to how visual-LLM align image encodings with textual embeddings.
☆12Updated last year
Alternatives and similar repositories for MusicTextAlignment
Users that are interested in MusicTextAlignment are comparing it to the libraries listed below
Sorting:
- [AAAI'24] Official dataset & demo code for MID-FiLD: MIDI Dataset for Fine-Level Dynamics☆18Updated last year
- ☆18Updated last year
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Updated last year
- A large-scale dataset of caption-annotated MIDI files.☆70Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆21Updated 6 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆46Updated 3 weeks ago
- ☆12Updated 6 years ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆60Updated 2 months ago
- Official Implementation of "Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music" (ISMIR 2021)☆59Updated 2 years ago
- Music Generative Pretrained Transformer☆27Updated 2 years ago
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆44Updated last month
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Updated last year
- ☆11Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆29Updated last month
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆13Updated 11 months ago
- Rearrange a music recording to match a new duration - Code for "Music Rearrangement Using Hierarchical Segmentation", ICASSP 2023☆44Updated last year
- Official source codes of airsep☆37Updated last year
- PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing☆70Updated 2 months ago
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆19Updated last year
- This is the official implementation of MusER (AAAI'24).☆30Updated 2 months ago
- fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes CU…☆22Updated last year
- ☆10Updated 2 years ago
- SOTA kilo-scale MIDI dataset for MIR and Music AI purposes☆59Updated last year
- ☆16Updated 3 months ago
- 🎹 MIDI Generator Piano Roll☆11Updated last month
- SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours☆28Updated 2 months ago
- A python package for high level musical data manipulation and preprocessing, making data ready to be fed to a neural network.☆42Updated 3 years ago
- TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching [HCMIR 2023]☆48Updated last year
- Real-time end-to-end singing voice convertion☆22Updated 9 months ago