elianakim / AmuseLinks
Official Implementation of Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations
☆19Updated last year
Alternatives and similar repositories for Amuse
Users that are interested in Amuse are comparing it to the libraries listed below
Sorting:
- ☆40Updated 9 months ago
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆17Updated last year
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Updated last year
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆162Updated 2 years ago
- ☆17Updated 2 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆40Updated 4 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Updated last year
- ☆59Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆43Updated last year
- Official implementation for AVGN☆40Updated 2 years ago
- ☆37Updated 6 months ago
- [ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.☆118Updated 5 months ago
- Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).☆86Updated last year
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Updated 2 years ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆63Updated 2 years ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆17Updated 4 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆88Updated last year
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆26Updated 7 months ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆27Updated 3 months ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆23Updated 2 years ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Updated 3 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆51Updated last year
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆79Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 3 years ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Updated last year
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆25Updated last year
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆47Updated last year
- Official PyTorch Implementation☆17Updated 3 years ago
- [ICCV 2023] Online Clustered Codebook☆181Updated last year