Cyanogenoid/pytorch-vqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cyanogenoid/pytorch-vqa)

Cyanogenoid / pytorch-vqa

Strong baseline for visual question answering

☆240

Alternatives and similar repositories for pytorch-vqa

Users that are interested in pytorch-vqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cadene / vqa.pytorch
View on GitHub
Visual Question Answering in Pytorch
☆733Dec 11, 2019Updated 6 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
Cyanogenoid / vqa-counting
View on GitHub
[ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering
☆208Mar 5, 2019Updated 7 years ago
DenisDsh / VizWiz-VQA-PyTorch
View on GitHub
PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind P…
☆64Oct 17, 2018Updated 7 years ago
aimbrain / vqa-project
View on GitHub
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆150Mar 11, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
tbmoon / basic_vqa
View on GitHub
Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)
☆98Aug 27, 2023Updated 2 years ago
markdtw / vqa-winner-cvprw-2017
View on GitHub
Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17
☆163Feb 8, 2019Updated 7 years ago
hexiang-hu / answer_embedding
View on GitHub
Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)
☆13Apr 6, 2019Updated 7 years ago
jnhwkim / ban-vqa
View on GitHub
Bilinear attention networks for visual question answering
☆549Oct 30, 2023Updated 2 years ago
Cadene / block.bootstrap.pytorch
View on GitHub
BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models
☆354Dec 4, 2019Updated 6 years ago
Shivanshu-Gupta / Visual-Question-Answering
View on GitHub
CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering
☆78Jan 19, 2020Updated 6 years ago
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago
peteanderson80 / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆1,470Feb 3, 2023Updated 3 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jiasenlu / HieCoAttenVQA
View on GitHub
☆351Oct 2, 2018Updated 7 years ago
GT-Vision-Lab / VQA
View on GitHub
☆392Mar 11, 2021Updated 5 years ago
SinghJasdeep / Attention-on-Attention-for-VQA
View on GitHub
Visual Question Answering Project with state of the art single Model performance.
☆130Jun 18, 2018Updated 8 years ago
aioz-ai / ICCV19_VQA-CTI
View on GitHub
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
☆38Nov 22, 2022Updated 3 years ago
chingyaoc / awesome-vqa
View on GitHub
Visual Q&A reading list
☆439Oct 7, 2018Updated 7 years ago
abhshkdz / neural-vqa-attention
View on GitHub
Attention-based Visual Question Answering in Torch
☆101Aug 13, 2017Updated 8 years ago
cvlab-tohoku / Dense-CoAttention-Network
View on GitHub
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
☆107Oct 14, 2019Updated 6 years ago
yuzcccc / vqa-mfb
View on GitHub
☆184Jul 30, 2019Updated 6 years ago
GT-Vision-Lab / VQA_LSTM_CNN
View on GitHub
Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…
☆386Mar 22, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
VedantYadav / VQA
View on GitHub
VQA - Visual Question Answering
☆14Nov 13, 2016Updated 9 years ago
rshivansh / San-Pytorch
View on GitHub
Let us try implementing SAN in pytorch from scratch
☆16Jun 7, 2018Updated 8 years ago
yikuan8 / Transformers-VQA
View on GitHub
An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER
☆165Dec 11, 2022Updated 3 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
HarshTrivedi / nmn-pytorch
View on GitHub
Neural Module Network for VQA in Pytorch
☆107Dec 16, 2017Updated 8 years ago
noagarcia / awesome-vqa-pytorch
View on GitHub
List of PyTorch repositories for visual question answering
☆15Jul 4, 2019Updated 7 years ago
ronghanghu / snmn
View on GitHub
Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
☆71Nov 17, 2019Updated 6 years ago
Cadene / murel.bootstrap.pytorch
View on GitHub
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
☆194Feb 9, 2020Updated 6 years ago
erobic / ramen
View on GitHub
This is a pytorch implementation of our Recurrent Aggregation of Multimodal Embeddings Network (RAMEN) from our CVPR-2019 paper.
☆17Apr 5, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gabegrand / adversarial-vqa
View on GitHub
☆12Aug 14, 2019Updated 6 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
varunagrawal / VisualQA
View on GitHub
Visual Question Answering in PyTorch
☆10Oct 22, 2025Updated 9 months ago
rosinality / mac-network-pytorch
View on GitHub
Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch
☆85Feb 5, 2019Updated 7 years ago
facebookresearch / grid-feats-vqa
View on GitHub
Grid features pre-training code for visual question answering
☆269Sep 17, 2021Updated 4 years ago
zcyang / imageqa-san
View on GitHub
code for Stacked attention networks for image question answering
☆108Jan 7, 2017Updated 9 years ago
ntusteeian / VQA_CNN-LSTM
View on GitHub
Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…
☆23Jul 30, 2020Updated 5 years ago