MarcBS / TMA
Egocentric Video Description based on Temporally-Linked Sequences
☆11Updated 7 years ago
Alternatives and similar repositories for TMA:
Users that are interested in TMA are comparing it to the libraries listed below
- Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'☆26Updated 6 years ago
- Referring expression comprehension on ReferIt(RefClef)☆9Updated 8 years ago
- image caption with semantic attention☆11Updated 7 years ago
- The implementation of Text-guided Attention Model for Image Captioning☆21Updated 7 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆8Updated 8 years ago
- a list of recent papers on transfer learning☆24Updated 7 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆24Updated 4 years ago
- Torch implementation for Stacked Attention Networks☆23Updated 8 years ago
- Implements an MLP for VQA☆7Updated 8 years ago
- Memory-augmented Attention Modelling for Videos☆10Updated 7 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 8 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Updated 7 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago
- Visual Verb Sense Disambiguation☆13Updated 5 years ago
- A collection of Deep Learning papers I read, sorted by category.☆9Updated 6 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- BISON: Binary Image SelectiON☆49Updated 3 years ago
- Implement Natural Language Object Retrieval in tensorflow☆35Updated 8 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆17Updated 7 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Updated 6 years ago
- ☆15Updated 7 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- Multi-Target Embodied Question Answering☆11Updated 5 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15Updated 3 years ago
- Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)☆58Updated 5 years ago
- This repository contains the code for the paper "Few-Shot Learning Through an Information Retrieval Lens". Eleni Triantafillou, Richard Z…☆25Updated 7 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆66Updated 6 years ago
- For visual commonsense model☆34Updated 5 years ago