aimagelab / mvad-names-datasetLinks
M-VAD Names Dataset. Multimedia Tools and Applications (2019)
☆20Updated 5 years ago
Alternatives and similar repositories for mvad-names-dataset
Users that are interested in mvad-names-dataset are comparing it to the libraries listed below
Sorting:
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated 2 years ago
- Unofficial sample code for Distilled 3D Networks (D3D) in Tensorflow.☆49Updated 6 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆60Updated 2 years ago
- This repository contains the code for our AAAI 2017 paper, "Learning Latent Sub-events in Activity Videos Using Temporal Attention Filter…☆23Updated 6 years ago
- Rethinking the Form of Latent States in Image Captioning☆21Updated 6 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Updated 4 years ago
- Dense video captioning in PyTorch☆41Updated 5 years ago
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆7Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Updated 6 years ago
- The Holistic Video Understanding Mini Dataset☆34Updated 5 years ago
- AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos. ECCV'18.☆76Updated 3 years ago
- TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".☆40Updated 6 years ago
- ☆29Updated 5 years ago
- ☆35Updated 6 years ago
- Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contex…☆34Updated 6 years ago
- The official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Effi…☆53Updated 5 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆32Updated 5 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆26Updated 5 years ago
- Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017☆17Updated 8 years ago
- Mixture-of-Embeddings-Experts☆119Updated 4 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Updated 4 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆46Updated last year
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- Code for training temporal fully-connected CRF models in Torch☆68Updated 6 years ago
- ☆47Updated 5 years ago
- convenience utilities for model validation☆23Updated 6 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Preprocess the activityNet dataset for detection task☆14Updated 8 years ago