PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind People
☆64Oct 17, 2018Updated 7 years ago
Alternatives and similar repositories for VizWiz-VQA-PyTorch
Users that are interested in VizWiz-VQA-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Mar 24, 2018Updated 8 years ago
- Strong baseline for visual question answering☆241Mar 13, 2023Updated 3 years ago
- Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)☆13Apr 6, 2019Updated 7 years ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago
- Let us try implementing SAN in pytorch from scratch☆16Jun 7, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Visual Question Answering through modal dialogue + API☆15Dec 8, 2022Updated 3 years ago
- Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)☆98Aug 27, 2023Updated 2 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 8 years ago
- This is a tensorflow implementation of 2018 NIPS paper: [GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations.…☆14Dec 2, 2018Updated 7 years ago
- Attention-based Visual Question Answering in Torch☆101Aug 13, 2017Updated 8 years ago
- Visual Q&A reading list☆440Oct 7, 2018Updated 7 years ago
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆19Oct 30, 2021Updated 4 years ago
- Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering☆150Mar 11, 2019Updated 7 years ago
- Stacked attention network for answering open-ended questions about image☆12May 31, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…☆298Jan 6, 2026Updated 3 months ago
- This project is out of date, I don't remember the details inside...☆84Dec 2, 2017Updated 8 years ago
- Subjective Image Captioning using Capsule Generative Adversarial Network☆11Jun 28, 2021Updated 4 years ago
- Repository for image caption for Chinese☆28Dec 3, 2017Updated 8 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…☆386Mar 22, 2019Updated 7 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆355Dec 4, 2019Updated 6 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Visual Question Answering Demo on pretrained model☆248Oct 31, 2025Updated 5 months ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆78Jan 19, 2020Updated 6 years ago
- A PyTorch implementation of Dual Attention Network☆30Mar 27, 2022Updated 4 years ago
- This is a PyTorch implementation of the Unsupervised Domain Adaptation method proposed in the paper Deep CORAL: Correlation Alignment for…☆59Oct 12, 2018Updated 7 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆27Mar 10, 2022Updated 4 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆207Mar 5, 2019Updated 7 years ago
- This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper☆22Jul 13, 2018Updated 7 years ago
- code for Stacked attention networks for image question answering☆108Jan 7, 2017Updated 9 years ago
- Bilinear attention networks for visual question answering☆548Oct 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Project for Dynamic Capsule Attention☆12Dec 7, 2019Updated 6 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- code for fluency-guided cross-lingual image captioning☆33Apr 13, 2018Updated 8 years ago
- Visual Question Answering task written in Keras that answers questions about images☆156May 10, 2019Updated 6 years ago
- PyTorch bottom-up attention with Detectron2☆240Jan 4, 2022Updated 4 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆126Feb 11, 2020Updated 6 years ago