Content-Based Video-Music Retrieval using Soft Intra-Modal Structure Constraint
☆62Sep 22, 2017Updated 8 years ago
Alternatives and similar repositories for VM-NET
Users that are interested in VM-NET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "Generating Music Medleys via Music Puzzle Games", AAAI 2018☆19Nov 6, 2018Updated 7 years ago
- ☆11Apr 30, 2025Updated last year
- ☆58Nov 2, 2020Updated 5 years ago
- A python module for generating photo mosaic videos from mp4s with a variety of features for color and granularity filtering☆10Dec 23, 2015Updated 10 years ago
- SVCNet: Scribble-based Video Colorization Network with Temporal Aggregation. IEEE TIP, 2023☆17Jul 21, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 7 years ago
- Codes and data for 《De-biased Court’s View Generation with Causality》 EMNLP 2020☆14Nov 29, 2021Updated 4 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Jun 22, 2018Updated 7 years ago
- Course Project for CS771: Machine Learning☆25Feb 19, 2017Updated 9 years ago
- youtube video recommendation(generation 4)☆21Oct 16, 2019Updated 6 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆106Aug 1, 2023Updated 2 years ago
- [ECCV2022] D2M-GAN for music generation from dance videos☆85Aug 16, 2022Updated 3 years ago
- ☆259Dec 10, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83May 7, 2018Updated 8 years ago
- Video retrieval from query images☆11Oct 10, 2017Updated 8 years ago
- Pytorch implementation of 'See, Hear, and Read: Deep Aligned Representations'☆33Dec 17, 2018Updated 7 years ago
- LaTeX template of NCKU Thesis☆11Nov 24, 2014Updated 11 years ago
- experiments on classifying actions using poses☆26May 5, 2018Updated 8 years ago
- Code repository for GCT634 Musical Applications of Machine Learning (Spring 2024)☆11May 19, 2024Updated last year
- Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks☆69Oct 9, 2020Updated 5 years ago
- Learn and L3 embedding from audio/video pairs☆89Apr 24, 2022Updated 4 years ago
- ☆30Feb 4, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Thesis template for IMSLab, NCKU CSIE.☆12Aug 16, 2019Updated 6 years ago
- ☆91Aug 29, 2018Updated 7 years ago
- Aquila is a digital signal processing library for C++11.☆15Nov 14, 2022Updated 3 years ago
- ☆17May 11, 2022Updated 3 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated last year
- Object detection and classification☆12Oct 19, 2018Updated 7 years ago
- GBDF: Gender Balanced DeepFake Dataset☆11Jul 22, 2022Updated 3 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- [CVPR 2026] OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆78Apr 20, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- an end-to-end instance-segmentation framework inspired by YOLO and mask R-CNN☆13Nov 22, 2019Updated 6 years ago
- The implementation of the PDANet☆41Sep 11, 2019Updated 6 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer☆57Jan 2, 2022Updated 4 years ago
- A python version of Spatiotemporal Multiplier Networks based on mxnet.☆10Jan 2, 2018Updated 8 years ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Mar 29, 2024Updated 2 years ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago