PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
☆40Dec 15, 2020Updated 5 years ago
Alternatives and similar repositories for Foley-Music
Users that are interested in Foley-Music are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Apr 30, 2025Updated 10 months ago
- code for "When Counterpoints Meet Chinese Folk Melody"☆11Feb 19, 2021Updated 5 years ago
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆18Apr 25, 2022Updated 3 years ago
- IMEMNet Dataset☆19Nov 3, 2020Updated 5 years ago
- ☆22Mar 20, 2024Updated 2 years ago
- ☆47Feb 10, 2021Updated 5 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago
- Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dan…☆37Jul 6, 2022Updated 3 years ago
- [AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models☆27Dec 14, 2023Updated 2 years ago
- Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020☆32Nov 6, 2020Updated 5 years ago
- Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018☆20May 31, 2022Updated 3 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆54Dec 15, 2020Updated 5 years ago
- ☆17Nov 26, 2020Updated 5 years ago
- [NeurIPS 2024] Code, Dataset, Samples for the VATT paper “ Tell What You Hear From What You See - Video to Audio Generation Through Text”☆36Jul 24, 2025Updated 8 months ago
- Official implementation for AVGN☆40Mar 24, 2023Updated 3 years ago
- ☆23Feb 19, 2021Updated 5 years ago
- This is pytorch implementation of paper Stable Video Style Transfer Based on Partial Convolution with Depth-Aware Supervision.☆13Aug 5, 2020Updated 5 years ago
- Cross-model active contrastive coding☆22Mar 17, 2021Updated 5 years ago
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆162Apr 5, 2023Updated 2 years ago
- ☆48Jul 10, 2024Updated last year
- multimodal transformer☆75Dec 9, 2021Updated 4 years ago
- Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing v…☆13Aug 18, 2023Updated 2 years ago
- ☆17Apr 1, 2021Updated 4 years ago
- ☆60Nov 19, 2018Updated 7 years ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Dec 6, 2022Updated 3 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- Code accompanying ISMIR 2020 paper - "Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature M…☆52Nov 3, 2020Updated 5 years ago
- ☆13Oct 3, 2023Updated 2 years ago
- The repository of the paper: Wang et al., Learning interpretable representation for controllable polyphonic music generation, ISMIR 2020.☆44Mar 22, 2024Updated 2 years ago
- JavaScript wrapper for Amen - a replacement for remix.js☆13Sep 28, 2019Updated 6 years ago
- An example raylib python application for viewing animation on the Geno character☆42Sep 16, 2025Updated 6 months ago
- Music Transformer Sequence Generation in Pytorch☆103Feb 6, 2020Updated 6 years ago
- Official implementation for "MOCHA: Real-Time Motion Characterization via Context Matching" [SIGGRAPH Asia 2023]☆25Jul 21, 2025Updated 8 months ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 9 months ago
- This repository holds datasets of polyphonic drum patterns used in the creation of Electronic Dance Music.☆16Dec 19, 2016Updated 9 years ago
- Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"☆17Jul 13, 2025Updated 8 months ago
- ☆19Sep 19, 2024Updated last year
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆27Jan 6, 2024Updated 2 years ago