MarcBS / TMA
Egocentric Video Description based on Temporally-Linked Sequences
☆11Updated 7 years ago
Alternatives and similar repositories for TMA:
Users that are interested in TMA are comparing it to the libraries listed below
- image caption with semantic attention☆12Updated 7 years ago
- Visual Verb Sense Disambiguation☆13Updated 5 years ago
- a list of recent papers on transfer learning☆24Updated 7 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- Memory-augmented Attention Modelling for Videos☆10Updated 7 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆25Updated 4 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆15Updated 8 years ago
- The implementation of Text-guided Attention Model for Image Captioning☆22Updated 7 years ago
- Implements an MLP for VQA☆8Updated 8 years ago
- ☆19Updated 5 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆9Updated 8 years ago
- Web demo:☆8Updated 5 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆20Updated 6 years ago
- Multi-Target Embodied Question Answering☆11Updated 5 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆15Updated 6 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago
- Torch implementation for Stacked Attention Networks☆24Updated 8 years ago
- ☆18Updated 8 years ago
- A Layered Memory Network for MovieQA☆16Updated 6 years ago
- Stacked attention network for answering open-ended questions about image☆12Updated 6 years ago
- Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'☆27Updated 6 years ago
- BISON: Binary Image SelectiON☆49Updated 3 years ago
- https://arxiv.org/abs/1707.00836☆22Updated 7 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- Github Code repo for the ICCV17 paper: 'Learning from Video and Text via Large-Scale Discriminative Clustering'☆9Updated 7 years ago
- Modular and Simple approach to VQA in Keras☆22Updated 7 years ago