pjlab-songcomposer / songcomposer
☆187Updated 2 months ago
Alternatives and similar repositories for songcomposer:
Users that are interested in songcomposer are comparing it to the libraries listed below
- MU-LLaMA: Music Understanding Large Language Model☆257Updated 10 months ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆168Updated 10 months ago
- The latent diffusion model for text-to-music generation.☆165Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆231Updated last month
- official code for CVPR'24 paper Diff-BGM☆54Updated 3 months ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆298Updated 9 months ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆71Updated 10 months ago
- Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models☆172Updated 8 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆140Updated 8 months ago
- Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model☆154Updated 6 months ago
- ☆40Updated last month
- ☆69Updated 3 months ago
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆339Updated this week
- ☆231Updated 9 months ago
- ☆52Updated 6 months ago
- Mustango: Toward Controllable Text-to-Music Generation☆350Updated 6 months ago
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆128Updated 3 weeks ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆96Updated this week
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆159Updated 6 months ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆80Updated last year
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆79Updated 6 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆126Updated 6 months ago
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆184Updated 2 months ago
- ☆149Updated 3 weeks ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆91Updated 3 months ago
- ☆34Updated 9 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆44Updated 4 months ago
- Long-Term Rhythmic Video Soundtracker, ICML2023☆55Updated 6 months ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆335Updated 9 months ago
- The Open Source Code of UniAudio☆540Updated 6 months ago