haofanwang / visbeat3Links
Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018
☆19Updated 3 years ago
Alternatives and similar repositories for visbeat3
Users that are interested in visbeat3 are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Updated 4 years ago
- [ECCV2022] D2M-GAN for music generation from dance videos☆86Updated 2 years ago
- Long-Term Rhythmic Video Soundtracker, ICML2023☆59Updated 2 weeks ago
- ☆11Updated 3 months ago
- The project page repo for Neural Dubber.☆30Updated last year
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆51Updated last year
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆33Updated 4 years ago
- ☆17Updated 4 years ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- Image Animation with Perturbed Masks☆12Updated 3 years ago
- Audio-driven synthesis of choreographic movements using GANs☆18Updated 5 years ago
- A simple library for extracting representations from Jukebox☆35Updated 2 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆65Updated 4 years ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆88Updated last year
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆64Updated 5 months ago
- The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.☆55Updated 4 years ago
- Starter code for working with the YouTube-8M dataset.☆16Updated 8 years ago
- ☆16Updated 4 years ago
- multimodal transformer☆74Updated 3 years ago
- Talking Head from Speech Audio using a Pre-trained Image Generator☆23Updated last year
- Extracted YouTube 8M URLs and Labels without all the TF Record parsing/features☆26Updated last year
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆57Updated 5 years ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆14Updated 2 years ago
- ☆18Updated 2 years ago
- ☆18Updated last year
- ☆20Updated 3 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- [ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer☆317Updated 2 months ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆67Updated last year