yukezhu/visual7w-toolkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yukezhu/visual7w-toolkit)

yukezhu / visual7w-toolkit

Toolkit for Visual7W visual question answering dataset

☆80

Alternatives and similar repositories for visual7w-toolkit

Users that are interested in visual7w-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yukezhu / visual7w-qa-models
View on GitHub
Visual7W visual question answering models
☆65Oct 8, 2019Updated 6 years ago
jnhwkim / nips-mrn-vqa
View on GitHub
Multimodal Residual Learning for Visual QA (NIPS 2016)
☆39Dec 27, 2016Updated 9 years ago
zhoubolei / VQAbaseline
View on GitHub
Simple Baseline for Visual Question Answering
☆186Dec 21, 2016Updated 9 years ago
shtechair / vqa-sva
View on GitHub
Structured Attentions for Visual Question Answering
☆46Mar 4, 2018Updated 8 years ago
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
asanakoy / cliquecnn
View on GitHub
Code for our paper "CliqueCNN: Deep Unsupervised Exemplar Learning" https://arxiv.org/abs/1608.08792
☆22Nov 10, 2017Updated 8 years ago
GT-Vision-Lab / VQA_LSTM_CNN
View on GitHub
Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…
☆386Mar 22, 2019Updated 7 years ago
abhshkdz / neural-vqa-attention
View on GitHub
Attention-based Visual Question Answering in Torch
☆101Aug 13, 2017Updated 8 years ago
jiasenlu / HieCoAttenVQA
View on GitHub
☆351Oct 2, 2018Updated 7 years ago
imatge-upc / vqa-2016-cvprw
View on GitHub
Visual question answering for CVPR16 VQA Challenge.
☆41Nov 5, 2016Updated 9 years ago
AishwaryaAgrawal / GVQA
View on GitHub
Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …
☆27Mar 10, 2022Updated 4 years ago
aimbrain / vqa-project
View on GitHub
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆150Mar 11, 2019Updated 7 years ago
ronghanghu / cmn
View on GitHub
Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017
☆67Sep 20, 2018Updated 7 years ago
zcyang / imageqa-san
View on GitHub
code for Stacked attention networks for image question answering
☆108Jan 7, 2017Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
GT-Vision-Lab / VQA
View on GitHub
☆392Mar 11, 2021Updated 5 years ago
jnhwkim / cbp
View on GitHub
Multimodal Compact Bilinear Pooling for Torch7
☆70Jan 2, 2017Updated 9 years ago
DeepRNN / visual_question_answering
View on GitHub
Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"
☆79Mar 22, 2018Updated 8 years ago
lmelvix / visual-question-answering-tensorflow
View on GitHub
Stacked attention network for answering open-ended questions about image
☆12May 31, 2018Updated 8 years ago
sidgan / whats_in_a_question
View on GitHub
CVPR'17 Spotlight: What’s in a Question: Using Visual Questions as a Form of Supervision
☆44Aug 31, 2018Updated 7 years ago
maifoundations / GCoT
View on GitHub
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
☆15Aug 11, 2025Updated 11 months ago
jnhwkim / MulLowBiVQA
View on GitHub
Hadamard Product for Low-rank Bilinear Pooling
☆72Nov 6, 2017Updated 8 years ago
sominw / vqamd_floyd
View on GitHub
Visual Question Answering through modal dialogue + API
☆15Dec 8, 2022Updated 3 years ago
MarcBS / VIBIKNet
View on GitHub
Visual Bidirectional Kernelized Network for Visual Question Answering
☆11Jul 17, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
satwikkottur / clevr-dialog
View on GitHub
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆50Feb 18, 2020Updated 6 years ago
s-gupta / visual-concepts
View on GitHub
Code for detecting visual concepts in images.
☆150Feb 27, 2018Updated 8 years ago
ranjaykrishna / visual_genome_python_driver
View on GitHub
A python wrapper for the Visual Genome API
☆371Sep 21, 2023Updated 2 years ago
spro / torch-seq2seq-attention
View on GitHub
Torch implementation of seq2seq machine translation with GRU RNN and attention
☆76Dec 4, 2016Updated 9 years ago
iamaaditya / VQA_Keras
View on GitHub
Modular and Simple approach to VQA in Keras
☆21Sep 6, 2017Updated 8 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
gidariss / AttractioNet
View on GitHub
Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
☆62Feb 12, 2019Updated 7 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
aioz-ai / ICCV19_VQA-CTI
View on GitHub
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
☆38Nov 22, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mpezeshki / Associative_LSTM
View on GitHub
LSTM with associative memory cells (http://arxiv.org/abs/1602.03032)
☆111May 1, 2016Updated 10 years ago
lichengunc / speaker_listener_reinforcer
View on GitHub
Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension
☆34Mar 8, 2018Updated 8 years ago
castorini / VDPWI-NN-Torch
View on GitHub
Very Deep Pairwise Word Interaction Neural Networks for modeling textual similarity (He and Lin, NAACL/HLT 2016)
☆18May 27, 2018Updated 8 years ago
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
iassael / torch-policy-gradient
View on GitHub
Deterministic Policy Gradient using torch7
☆43Jun 2, 2016Updated 10 years ago
hexiang-hu / answer_embedding
View on GitHub
Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)
☆13Apr 6, 2019Updated 7 years ago
eladhoffer / captionGeneration.torch
View on GitHub
Generate captions for an image using convolutional and recurrent networks
☆12Feb 25, 2016Updated 10 years ago