elianakim / AmuseLinks
Official Implementation of Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations
☆14Updated 5 months ago
Alternatives and similar repositories for Amuse
Users that are interested in Amuse are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation☆18Updated 2 years ago
- ☆35Updated 2 months ago
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆46Updated 2 months ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆11Updated 2 years ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆17Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- ☆16Updated last year
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆160Updated 2 years ago
- [ICLR'23] New Insights for the Stability-Plasticity Dilemma in Online Continual Learning☆20Updated 2 years ago
- Official implementation for AVGN☆35Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- ☆33Updated 4 years ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆37Updated 2 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆42Updated 6 months ago
- ☆54Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆56Updated 2 years ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆66Updated last year
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆33Updated 4 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆42Updated 9 months ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆14Updated 3 years ago
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Updated 3 years ago
- Code for the paper "Multi-scale Diffusion Denoised Smoothing" (NeurIPS 2023)☆14Updated last year
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆42Updated last year
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated last year
- ☆54Updated 8 months ago
- Learning Large-scale Neural Fields via Context Pruned Meta-Learning (NeurIPS 2023)☆26Updated last year
- ☆20Updated last year
- PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer☆58Updated 5 years ago
- ☆40Updated last year