shtdbb / MusicTextAlignment

This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal models in music-text alignment tasks, similar to how visual-LLM align image encodings with textual embeddings.
11Updated 10 months ago

Related projects: