aimagelab / mvad-names-datasetLinks
M-VAD Names Dataset. Multimedia Tools and Applications (2019)
☆22Updated 6 years ago
Alternatives and similar repositories for mvad-names-dataset
Users that are interested in mvad-names-dataset are comparing it to the libraries listed below
Sorting:
- Mixture-of-Embeddings-Experts☆120Updated 5 years ago
- ☆48Updated 5 years ago
- Dense video captioning in PyTorch☆41Updated 6 years ago
- A dataset with user created GIFs☆49Updated 7 years ago
- Unofficial sample code for Distilled 3D Networks (D3D) in Tensorflow.☆49Updated 6 years ago
- A PyTorch implementation of VSumPtrGAN☆39Updated 2 years ago
- A dataset with user created GIFs☆65Updated 7 years ago
- Code and demos for our paper at ACM MM 2017☆62Updated 6 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated 2 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 8 years ago
- ☆35Updated 6 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆83Updated 6 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆130Updated 4 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Updated 5 years ago
- AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos. ECCV'18.☆78Updated 3 years ago
- Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017☆17Updated 8 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated 2 years ago
- Action Proposals generated by deep models☆29Updated 8 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Updated 6 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆47Updated last year
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Updated 5 years ago
- A Dataset for Grounded Video Description☆163Updated 3 years ago
- The Holistic Video Understanding Mini Dataset☆34Updated 5 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Updated 5 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- PyTorch implementation of Video Summarization on Twitch (LOL) dataset☆38Updated 7 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Updated 3 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Updated 5 years ago