elianakim / AmuseLinks
Official Implementation of Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations
☆19Updated 11 months ago
Alternatives and similar repositories for Amuse
Users that are interested in Amuse are comparing it to the libraries listed below
Sorting:
- ☆40Updated 8 months ago
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆17Updated 11 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆12Updated last year
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆39Updated 3 months ago
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆162Updated 2 years ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆77Updated last year
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆17Updated 4 years ago
- ☆47Updated last year
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆25Updated 2 months ago
- Official implementation for AVGN☆39Updated 2 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Updated last year
- ☆58Updated last year
- [CVPR 2025] Pytorch implementation of the paper "Hearing Anywhere in Any Environment"☆25Updated 3 months ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Updated 3 years ago
- ☆17Updated 2 years ago
- Official PyTorch Implementation☆18Updated 3 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆51Updated last year
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Updated 2 months ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Updated 8 months ago
- ☆30Updated 3 years ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆63Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Updated 2 years ago
- ☆19Updated last year
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆13Updated 9 months ago
- ☆34Updated 5 months ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆24Updated 2 years ago
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆25Updated 6 months ago
- official code for CVPR'24 paper Diff-BGM☆72Updated last year
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆47Updated last year