DenisDsh / VizWiz-VQA-PyTorchView external linksLinks
PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind People
☆63Oct 17, 2018Updated 7 years ago
Alternatives and similar repositories for VizWiz-VQA-PyTorch
Users that are interested in VizWiz-VQA-PyTorch are comparing it to the libraries listed below
Sorting:
- ☆29Mar 24, 2018Updated 7 years ago
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- Strong baseline for visual question answering☆240Mar 13, 2023Updated 2 years ago
- Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)☆13Apr 6, 2019Updated 6 years ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago
- Released code for the paper: Where To Look: Focus Regions for Visual Question Answering. (CVPR2016)☆10Apr 8, 2020Updated 5 years ago
- Stacked attention network for answering open-ended questions about image☆12May 31, 2018Updated 7 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 7 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆27Mar 10, 2022Updated 3 years ago
- Visual Question Answering through modal dialogue + API☆15Dec 8, 2022Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆765Mar 10, 2024Updated last year
- Let us try implementing SAN in pytorch from scratch☆16Jun 7, 2018Updated 7 years ago
- This project is out of date, I don't remember the details inside...☆84Dec 2, 2017Updated 8 years ago
- This is a tensorflow implementation of 2018 NIPS paper: [GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations.…☆14Dec 2, 2018Updated 7 years ago
- Subjective Image Captioning using Capsule Generative Adversarial Network☆11Jun 28, 2021Updated 4 years ago
- A PyTorch implementation of Dual Attention Network☆30Mar 27, 2022Updated 3 years ago
- Repository for image caption for Chinese☆28Dec 3, 2017Updated 8 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…☆297Jan 6, 2026Updated last month
- Visual Q&A reading list☆440Oct 7, 2018Updated 7 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering☆150Mar 11, 2019Updated 6 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆356Dec 4, 2019Updated 6 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆77Jan 19, 2020Updated 6 years ago
- PyTorch bottom-up attention with Detectron2☆239Jan 4, 2022Updated 4 years ago
- Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.☆20Jun 28, 2018Updated 7 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 7 years ago
- This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper☆22Jul 13, 2018Updated 7 years ago
- ☆183Jul 30, 2019Updated 6 years ago
- ☆25Oct 31, 2022Updated 3 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- Chinese Visual Question Answering 中文看图问答☆47Sep 16, 2017Updated 8 years ago
- [ICLR 2018] Learning to Count Objects in Natural Images for Visual Question Answering☆207Mar 5, 2019Updated 6 years ago
- Attention-based Visual Question Answering in Torch☆101Aug 13, 2017Updated 8 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆163Feb 8, 2019Updated 7 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,465Feb 3, 2023Updated 3 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆423Jan 18, 2022Updated 4 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 7 years ago
- Hierarchical Question-Image Co-Attention for Visual Question Answering☆24Jun 2, 2019Updated 6 years ago