zongshenmu/attention_knowledge_vqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zongshenmu/attention_knowledge_vqa)

zongshenmu / attention_knowledge_vqa

vqa drived by bottom-up and top-down attention and knowledge

☆14

Alternatives and similar repositories for attention_knowledge_vqa

Users that are interested in attention_knowledge_vqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
Wentong-DST / up-down-captioner
View on GitHub
Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"
☆29Oct 24, 2018Updated 7 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
HLR / Cross_Modality_Relevance
View on GitHub
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27May 6, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
chrisc36 / debias
View on GitHub
Methods of training NLP models to ignored biased strategies
☆55May 22, 2023Updated 3 years ago
SinghJasdeep / Attention-on-Attention-for-VQA
View on GitHub
Visual Question Answering Project with state of the art single Model performance.
☆130Jun 18, 2018Updated 8 years ago
cdancette / rubi.bootstrap.pytorch
View on GitHub
NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering
☆66Mar 29, 2021Updated 5 years ago
XMUVQA / CapsAtt
View on GitHub
Project for Dynamic Capsule Attention
☆12Dec 7, 2019Updated 6 years ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
JunweiLiang / FVTA_MemexQA
View on GitHub
Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
☆33Jul 1, 2019Updated 7 years ago
yangxuntu / catt
View on GitHub
☆12Mar 8, 2021Updated 5 years ago
jialinwu17 / self_critical_vqa
View on GitHub
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆40Sep 9, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
crownpku / Chinese-VQA
View on GitHub
Chinese Visual Question Answering 中文看图问答
☆47Sep 16, 2017Updated 8 years ago
chrisc36 / bottom-up-attention-vqa
View on GitHub
BottomUpTopDown VQA model with question-type debiasing
☆22Oct 6, 2019Updated 6 years ago
asdf0982 / vqa-mfb.pytorch
View on GitHub
This project is out of date, I don't remember the details inside...
☆85Dec 2, 2017Updated 8 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 5 years ago
alibabadoufu / dynamic_fusion_reimplementation
View on GitHub
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆17Oct 30, 2019Updated 6 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
YulongBonjour / BrainCLIP
View on GitHub
Coming soon~
☆14Jul 15, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
chhwang / cmcl
View on GitHub
This code is for the paper "Confident Multiple Choice Learning".
☆17Aug 4, 2018Updated 7 years ago
DerekDLP / VQA-papers
View on GitHub
A list of recent papers regarding visual(image) question answering「mainly from arxiv.com」
☆16Mar 6, 2019Updated 7 years ago
BonnieHuangxin / SLTA
View on GitHub
ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》
☆36Jun 19, 2019Updated 7 years ago
JonghwanMun / MarioQA
View on GitHub
Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017
☆10Oct 28, 2025Updated 9 months ago
goodskillprogramer / SentenceSimilarity
View on GitHub
☆16May 9, 2018Updated 8 years ago
VirajBagal / MMBERT
View on GitHub
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
☆39Mar 22, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
runzhouge / MAC
View on GitHub
MAC: Mining Activity Concepts for Language-based Temporal Localization
☆36Nov 26, 2018Updated 7 years ago
cdancette / detect-shortcuts
View on GitHub
Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
☆29Jul 1, 2024Updated 2 years ago
acambray / GroundeR-PyTorch
View on GitHub
This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch
☆18Apr 7, 2020Updated 6 years ago
MILVLG / openvqa
View on GitHub
A lightweight, scalable, and general framework for visual question answering research
☆334Sep 3, 2021Updated 4 years ago
yuzcccc / vqa-mfb
View on GitHub
☆184Jul 30, 2019Updated 6 years ago
UCSB-AI / CPL
View on GitHub
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Dec 5, 2022Updated 3 years ago
dice-group / TeBaQA
View on GitHub
A question answering system which utilises machine learning.
☆21Oct 30, 2023Updated 2 years ago