gujiuxiang / Visual_Question_Answering.pytorch

☆26

Related projects: ⓘ

ruotianluo / refexp-comprehension
Referring expression comprehension on ReferIt(RefClef)
☆10Updated 7 years ago
shtechair / vqa-sva
Structured Attentions for Visual Question Answering
☆47Updated 6 years ago
JonghwanMun / TextguidedATT
The implementation of Text-guided Attention Model for Image Captioning
☆22Updated 6 years ago
rasoolfa / videocap
Memory-augmented Attention Modelling for Videos
☆10Updated 7 years ago
ffmpbgrnn / VideoQA
Project Uncovering Temporal Context for Video Question and Answering
☆15Updated 8 years ago
eriche2016 / image_caption_with_semantic_attenion
image caption with semantic attention
☆12Updated 7 years ago
arunmallya / simple-vqa
Implements an MLP for VQA
☆8Updated 7 years ago
chingyaoc / san-torch
Torch implementation for Stacked Attention Networks
☆24Updated 7 years ago
kevjshih / wtl_vqa
Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)
☆10Updated 4 years ago
tgGuo15 / PriorImageCaption
☆29Updated 5 years ago
jz462 / ContrastiveLosses4VRD
Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"
☆12Updated 4 years ago
gnouhp / PyTorch-AdaHAN
An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…
☆53Updated 6 years ago
ronghanghu / cmn
Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017
☆67Updated 6 years ago
TheShadow29 / visual-commonsense-pytorch
For visual commonsense model
☆34Updated 5 years ago
arijitray1993 / VQARelevance
Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
☆15Updated 6 years ago
idansc / HighOrderAtten
☆15Updated 6 years ago
aylai / DenotationGraph
Generate a denotation graph from a set of image captions
☆15Updated 6 years ago
ExplorerFreda / VSE-C
[COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.
☆57Updated 5 years ago
VisionLearningGroup / Ask_Attend_and_Answer
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
☆25Updated 3 years ago
bowong / Layered-Memory-Network
A Layered Memory Network for MovieQA
☆16Updated 6 years ago
yiyang92 / vae_captioning
Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
☆57Updated 6 years ago
doubledaibo / clcaption_nips2017
Contrastive Learning for Image Captioning
☆51Updated 6 years ago
doubledaibo / 2dcaption_eccv2018
Rethinking the Form of Latent States in Image Captioning
☆21Updated 6 years ago
eric-xw / Zero-Shot-Video-Captioning
☆33Updated this week
BryanPlummer / cite
Implementation for our paper "Conditional Image-Text Embedding Networks"
☆38Updated 4 years ago
sungraepark / Adversarial-Dropout
☆32Updated 5 years ago
ili3p / vqa-soft
Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 V…
☆15Updated 7 years ago
xiaolonw / CharadesDet
Charades Object Detection Dataset (ICCV 2017)
☆30Updated 6 years ago
varun-nagaraja / referring-expressions
Localize objects in images using referring expressions
☆37Updated 7 years ago
Cold-Winter / vqs
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
☆22Updated 7 years ago