asdf0982/vqa-mfb.pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/asdf0982/vqa-mfb.pytorch)

asdf0982 / vqa-mfb.pytorch

This project is out of date, I don't remember the details inside...

☆85

Alternatives and similar repositories for vqa-mfb.pytorch

Users that are interested in vqa-mfb.pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuzcccc / vqa-mfb
View on GitHub
☆184Jul 30, 2019Updated 6 years ago
MILVLG / openvqa
View on GitHub
A lightweight, scalable, and general framework for visual question answering research
☆334Sep 3, 2021Updated 4 years ago
shtechair / vqa-sva
View on GitHub
Structured Attentions for Visual Question Answering
☆46Mar 4, 2018Updated 8 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
idansc / HighOrderAtten
View on GitHub
☆16Dec 22, 2017Updated 8 years ago
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
akirafukui / vqa-mcb
View on GitHub
☆219Aug 13, 2016Updated 9 years ago
jnhwkim / ban-vqa
View on GitHub
Bilinear attention networks for visual question answering
☆549Oct 30, 2023Updated 2 years ago
Cyanogenoid / vqa-counting
View on GitHub
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆208Mar 5, 2019Updated 7 years ago
AishwaryaAgrawal / GVQA
View on GitHub
Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …
☆27Mar 10, 2022Updated 4 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
Cadene / murel.bootstrap.pytorch
View on GitHub
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
☆194Feb 9, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
arijitray1993 / VQARelevance
View on GitHub
Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
☆14Aug 6, 2018Updated 7 years ago
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆38Nov 23, 2019Updated 6 years ago
Shivanshu-Gupta / Visual-Question-Answering
View on GitHub
CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering
☆78Jan 19, 2020Updated 6 years ago
peteanderson80 / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆1,470Feb 3, 2023Updated 3 years ago
chenxinpeng / Optimization_of_image_description_metrics_using_policy_gradient_methods
View on GitHub
Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods
☆29Jul 31, 2018Updated 7 years ago
gnouhp / PyTorch-AdaHAN
View on GitHub
An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…
☆54Sep 1, 2018Updated 7 years ago
kevjshih / wtl_vqa
View on GitHub
Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)
☆10Apr 8, 2020Updated 6 years ago
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago
lichengunc / speaker_listener_reinforcer
View on GitHub
Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension
☆34Mar 8, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tzuhsial / pytorch-vqa-dan
View on GitHub
A PyTorch implementation of Dual Attention Network
☆30Mar 27, 2022Updated 4 years ago
agakshat / visualdialog-pytorch
View on GitHub
Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359
☆15May 16, 2019Updated 7 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
HLR / Cross_Modality_Relevance
View on GitHub
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27May 6, 2021Updated 5 years ago
aimbrain / vqa-project
View on GitHub
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆150Mar 11, 2019Updated 7 years ago
markdtw / vqa-winner-cvprw-2017
View on GitHub
Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17
☆163Feb 8, 2019Updated 7 years ago
alibabadoufu / dynamic_fusion_reimplementation
View on GitHub
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆17Oct 30, 2019Updated 6 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
gdlg / pytorch_compact_bilinear_pooling
View on GitHub
Compact Bilinear Pooling for PyTorch
☆254Jul 6, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
seqam-lab / DMVAE
View on GitHub
☆18Feb 16, 2022Updated 4 years ago
pliang279 / factorized
View on GitHub
[ICLR 2019] Learning Factorized Multimodal Representations
☆69Aug 4, 2020Updated 5 years ago
ZihaoWang-CV / CAMP_iccv19
View on GitHub
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
☆126Feb 26, 2020Updated 6 years ago
lmelvix / visual-question-answering-tensorflow
View on GitHub
Stacked attention network for answering open-ended questions about image
☆12May 31, 2018Updated 8 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
BierOne / relation-vqa
View on GitHub
Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated 4 months ago
rshivansh / San-Pytorch
View on GitHub
Let us try implementing SAN in pytorch from scratch
☆16Jun 7, 2018Updated 8 years ago