wzk1015 / video-bgm-generationLinks
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
☆318Updated 2 months ago
Alternatives and similar repositories for video-bgm-generation
Users that are interested in video-bgm-generation are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP☆354Updated 3 years ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆77Updated last year
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆365Updated last year
- The latent diffusion model for text-to-music generation.☆174Updated last year
- Official implementation of compound word transformer (AAAI'21)☆278Updated last year
- [ECCV2022] D2M-GAN for music generation from dance videos☆86Updated 3 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆53Updated 4 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Updated 4 years ago
- Symbolic Music Generation with Diffusion Models☆261Updated this week
- IMEMNet Dataset☆19Updated 4 years ago
- Symphony Generation with Permutation Invariant Language Model☆256Updated 2 years ago
- Emotional conditioned music generation using transformer-based model.☆159Updated 2 years ago
- PyTorch implementation of MuseMorphose (published at IEEE/ACM TASLP), a Transformer-based model for music style transfer.☆186Updated 2 years ago
- MU-LLaMA: Music Understanding Large Language Model☆285Updated last week
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆59Updated 3 weeks ago
- ☆55Updated 8 months ago
- ☆11Updated 3 months ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Updated 2 years ago
- AudioLDM training, finetuning, evaluation and inference.☆272Updated 8 months ago
- Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018☆19Updated 3 years ago
- This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.☆197Updated last year
- [CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation☆440Updated last year
- MIDI, WAV domain music emotion recognition [ISMIR 2021]☆83Updated 3 years ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆88Updated last year
- Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code☆443Updated last year
- Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)☆149Updated last year
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆337Updated last year
- Mustango: Toward Controllable Text-to-Music Generation☆373Updated 2 months ago
- VGGSound: A Large-scale Audio-Visual Dataset☆326Updated 3 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆51Updated last year