wbs2788 / MTMLinks
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging different representations and enhancing generation with RAG.
☆28Updated last year
Alternatives and similar repositories for MTM
Users that are interested in MTM are comparing it to the libraries listed below
Sorting:
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆48Updated last year
- ☆29Updated 3 months ago
- official code for CVPR'24 paper Diff-BGM☆71Updated last year
- XMIDI Dataset: A large-scale symbolic music dataset with emotion and genre labels.☆32Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆28Updated last year
- ☆54Updated last year
- ☆50Updated last year
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆61Updated 6 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Updated last month
- Official source codes of airsep☆39Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Updated 3 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆105Updated last month
- ☆19Updated 9 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Updated last year
- ☆124Updated last year
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Updated 11 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Updated 11 months ago
- This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…☆60Updated last year
- ☆32Updated last month
- ☆58Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆27Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Updated 4 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Updated last year
- Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models☆43Updated 11 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆51Updated 6 months ago
- The implementation of "Systematic Analysis of Music Representations from BERT"☆27Updated 2 years ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆58Updated 3 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Updated last year
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆45Updated last year
- MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]☆51Updated last month