wbs2788 / VMB
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging different representations and enhancing generation with RAG.
☆22Updated last month
Alternatives and similar repositories for VMB:
Users that are interested in VMB are comparing it to the libraries listed below
- ☆32Updated 2 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆34Updated 4 months ago
- text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language mode…☆31Updated last week
- Official Repository for The Paper, PianoBART: Symbolic Piano Music Understanding and Generating with Large-Scale Pre-Training☆16Updated 3 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆32Updated this week
- Official source codes of airsep☆35Updated 9 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆21Updated 4 months ago
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 8 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 4 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆74Updated 3 weeks ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆35Updated last year
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆18Updated 4 months ago
- Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial …☆14Updated this week
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 7 months ago
- CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models☆48Updated last week
- Codebase and project page for EDMSound☆33Updated last year
- TS-BSmamba2: A TWO-STAGE BAND-SPLIT MAMBA-2 NETWORK FOR MUSIC SEPARATION☆45Updated 4 months ago
- This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…☆50Updated 4 months ago
- Real-time Timbre Remapping with Differentiable DSP.☆9Updated 2 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆77Updated 4 months ago
- Project for MIDI to Audio Synthesis☆22Updated last year
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆11Updated last year
- Repository for Semi-supervised Synthesizer Sound Matching with Differentiable DSP☆20Updated 2 years ago
- Code for paper "Network Bending of Diffusion Models for Audio-Visual Generation" at DAFx 2024☆13Updated 7 months ago
- Code for Investigating Personalization Methods in Text to Music Generation☆36Updated 9 months ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆36Updated 6 months ago
- Official source codes of coco-mulla☆30Updated 9 months ago
- ☆37Updated this week