elianakim / AmuseLinks
Official Implementation of Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations
☆15Updated 8 months ago
Alternatives and similar repositories for Amuse
Users that are interested in Amuse are comparing it to the libraries listed below
Sorting:
- ☆36Updated 5 months ago
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆162Updated 2 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆31Updated 2 weeks ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆43Updated 9 months ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆76Updated last year
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- Official PyTorch Implementation☆18Updated 2 years ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Updated last year
- ☆58Updated 11 months ago
- Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).☆84Updated last year
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Updated 3 years ago
- ☆46Updated last year
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆86Updated last year
- Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy M…☆36Updated last year
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆57Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- [NeurIPS'22] Official code of "ComMU: Dataset for Combinatorial Music Generation"☆140Updated 2 years ago
- [ICLR'23] New Insights for the Stability-Plasticity Dilemma in Online Continual Learning☆20Updated 2 years ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆23Updated 2 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆44Updated last year
- Unofficial download repository for MusicCaps☆47Updated 2 years ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆33Updated 7 months ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆12Updated 4 months ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆20Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆61Updated 2 years ago
- ☆17Updated last year
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆119Updated 2 years ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆45Updated last year