arunmallya / simple-vqaLinks

Implements an MLP for VQA

☆7

Alternatives and similar repositories for simple-vqa

Users that are interested in simple-vqa are comparing it to the libraries listed below

Sorting:

VisionLearningGroup / Ask_Attend_and_Answer
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
☆25Updated 4 years ago
eriche2016 / image_caption_with_semantic_attenion
image caption with semantic attention
☆11Updated 8 years ago
kevjshih / wtl_vqa
Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)
☆10Updated 5 years ago
lichengunc / vist_api
Visual Storytelling API
☆36Updated 8 years ago
chingyaoc / san-torch
Torch implementation for Stacked Attention Networks
☆23Updated 8 years ago
JonghwanMun / TextguidedATT
The implementation of Text-guided Attention Model for Image Captioning
☆21Updated 7 years ago
rlebret / phrase-based_image_captioning
Torch implementation of ICML 2015 paper about image captioning
☆9Updated 9 years ago
varun-nagaraja / referring-expressions
Localize objects in images using referring expressions
☆37Updated 8 years ago
imatge-upc / vqa-2016-cvprw
Visual question answering for CVPR16 VQA Challenge.
☆41Updated 8 years ago
shtechair / vqa-sva
Structured Attentions for Visual Question Answering
☆46Updated 7 years ago
aylai / DenotationGraph
Generate a denotation graph from a set of image captions
☆15Updated 6 years ago
ffmpbgrnn / VideoQA
Project Uncovering Temporal Context for Video Question and Answering
☆14Updated 9 years ago
jacobandreas / pragma
Reasoning about pragmatics with neural listeners and speakers
☆22Updated 9 years ago
andrewliao11 / Natural-Language-Object-Retrieval-tensorflow
Implement Natural Language Object Retrieval in tensorflow
☆35Updated 8 years ago
LuoweiZhou / e2e-gLSTM-sc
Code for paper "Image Caption Generation with Text-Conditional Semantic Attention"
☆60Updated 7 years ago
jnhwkim / MulLowBiVQA
Hadamard Product for Low-rank Bilinear Pooling
☆70Updated 7 years ago
ronghanghu / cmn
Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017
☆67Updated 6 years ago
spandanagella / verse
Visual Verb Sense Disambiguation
☆13Updated 6 years ago
arijitray1993 / VQARelevance
Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
☆14Updated 6 years ago
rasoolfa / videocap
Memory-augmented Attention Modelling for Videos
☆9Updated 8 years ago
jnhwkim / nips-mrn-vqa
Multimodal Residual Learning for Visual QA (NIPS 2016)
☆38Updated 8 years ago
idansc / HighOrderAtten
☆15Updated 7 years ago
ruotianluo / refexp-comprehension
Referring expression comprehension on ReferIt(RefClef)
☆10Updated 8 years ago
peteanderson80 / coco-caption
Adds SPICE metric to coco-caption evaluation server codes
☆50Updated 2 years ago
vsubhashini / caption-eval
Sentence/Caption evaluation using automated metrics
☆61Updated 9 years ago
uwnlp / verb-attributes
Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'
☆26Updated 7 years ago
ronghanghu / snmn
Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
☆71Updated 5 years ago
lupantech / dual-mfa-vqa
Co-attending Regions and Detections for VQA.
☆40Updated 7 years ago
evanmiltenburg / Flickr30k-Image-Viewer
Small Flask-based apps to browse the Flickr30k dataset.
☆20Updated 8 years ago
deshraj / VQA-Chatbot
A Chatbot based on VQA (Visual Question Answering)
☆17Updated 8 years ago