elianakim / AmuseLinks
Official Implementation of Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations
☆16Updated 9 months ago
Alternatives and similar repositories for Amuse
Users that are interested in Amuse are comparing it to the libraries listed below
Sorting:
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆162Updated 2 years ago
- ☆36Updated 6 months ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Updated 10 months ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆31Updated last month
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Updated 2 years ago
- ☆58Updated last year
- Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy M…☆36Updated last year
- Official PyTorch Implementation☆18Updated 2 years ago
- [ICCV 2023] Online Clustered Codebook☆176Updated last year
- Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).☆84Updated last year
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆23Updated 2 years ago
- ☆47Updated last year
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Updated 3 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆85Updated last year
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- Official implementation for AVGN☆37Updated 2 years ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated 2 years ago
- ☆17Updated 2 years ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆76Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆46Updated last year
- ☆35Updated 4 months ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Updated last year
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆30Updated last year
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆121Updated 2 years ago
- Contrastively Disentangled Sequential Variational Audoencoder☆48Updated last year
- ☆12Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆13Updated 5 months ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆18Updated 10 months ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆19Updated last year
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Updated 2 years ago