Content-Based Video-Music Retrieval using Soft Intra-Modal Structure Constraint
☆62Sep 22, 2017Updated 8 years ago
Alternatives and similar repositories for VM-NET
Users that are interested in VM-NET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Starter code for working with the YouTube-8M dataset.☆16Jun 9, 2017Updated 9 years ago
- Cross-modality (visual-auditory) Metric Learning Project☆15Dec 19, 2017Updated 8 years ago
- ☆13Aug 21, 2022Updated 3 years ago
- "Generating Music Medleys via Music Puzzle Games", AAAI 2018☆19Nov 6, 2018Updated 7 years ago
- Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018☆20May 31, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆58Nov 2, 2020Updated 5 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Nov 14, 2017Updated 8 years ago
- SVCNet: Scribble-based Video Colorization Network with Temporal Aggregation. IEEE TIP, 2023☆17Jul 21, 2025Updated 10 months ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 8 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Jun 22, 2018Updated 7 years ago
- youtube video recommendation(generation 4)☆21Oct 16, 2019Updated 6 years ago
- [ECCV2022] D2M-GAN for music generation from dance videos☆85Aug 16, 2022Updated 3 years ago
- ☆260Dec 10, 2022Updated 3 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83May 7, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LaTeX template of NCKU Thesis☆11Nov 24, 2014Updated 11 years ago
- experiments on classifying actions using poses☆26May 5, 2018Updated 8 years ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Learning from Limited and Imperfect Data (L2ID): Classification Challenges☆17Feb 7, 2021Updated 5 years ago
- ☆10Nov 6, 2017Updated 8 years ago
- The code for shuttleNet.☆31Jul 25, 2017Updated 8 years ago
- Theano implementation of Sequence-to-Sequence Autoencoder☆13Jun 1, 2018Updated 8 years ago
- Notes and slides for a course on social and scientific aspects of machine learning☆10Updated this week
- Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks☆69Oct 9, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learn and L3 embedding from audio/video pairs☆89Apr 24, 2022Updated 4 years ago
- SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.☆11Nov 15, 2025Updated 7 months ago
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images☆30Jun 14, 2019Updated 7 years ago
- Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval(CVPR2018)☆164Jul 20, 2018Updated 7 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆209Apr 3, 2021Updated 5 years ago
- ☆91Aug 29, 2018Updated 7 years ago
- An UWP client software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的UWP客户端软件☆12Oct 23, 2019Updated 6 years ago
- GBDF: Gender Balanced DeepFake Dataset☆11Jul 22, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- an end-to-end instance-segmentation framework inspired by YOLO and mask R-CNN☆13Nov 22, 2019Updated 6 years ago
- ☆31Feb 4, 2021Updated 5 years ago
- The implementation of the PDANet☆41Sep 11, 2019Updated 6 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- [CVPR 2026] OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆91Apr 20, 2026Updated last month
- CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer☆57Jan 2, 2022Updated 4 years ago