ldzhangyx / instruct-MusicGen
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".
☆65Updated 2 weeks ago
Related projects: ⓘ
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆111Updated last month
- Official Implementation of EnCLAP (ICASSP 2024)☆88Updated 3 months ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆50Updated 8 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆26Updated last week
- Unofficial download repository for MusicCaps☆41Updated last year
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆79Updated 2 weeks ago
- Audiogen Codec☆116Updated 2 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Robust Singing Voice Transcription and MIDI Extraction☆47Updated last month
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆111Updated last year
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 3 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆36Updated 9 months ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆27Updated 8 months ago
- Official source codes of airsep☆33Updated 5 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆62Updated last month
- ☆37Updated 3 months ago
- million song dataset split for extended clean tag & artist-level stratified☆46Updated last year
- ☆59Updated 5 months ago
- Codebase and project page for EDMSound☆29Updated 9 months ago
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆71Updated 2 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆131Updated 8 months ago
- ☆78Updated last year
- A DDSP-based neural voice synthesiser.☆95Updated last week
- Encode and decode audio samples to/from compressed latent representations!☆119Updated last month
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆25Updated 4 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆50Updated 10 months ago
- ☆26Updated 10 months ago
- PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.☆169Updated 3 weeks ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆33Updated 10 months ago