bupt-cist / vqa-playground-pytorchView external linksLinks
Code for NIPS 2018 paper, "Chain of Reasoning for Visual Question Answering"
☆28Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for vqa-playground-pytorch
Users that are interested in vqa-playground-pytorch are comparing it to the libraries listed below
Sorting:
- GuessWhat?! is a challenging task-oriented visual dialogue problem.<br>Tensorflow code for the papers, <Visual Dialogue State Tracking f…☆11May 16, 2024Updated last year
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 2 years ago
- Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering☆150Mar 11, 2019Updated 6 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- Recompile the library with caffe2 in pytorch stable(1.0) and re-implement the AICamera example provided by caffe2 officially.☆35Jan 8, 2019Updated 7 years ago
- Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359☆15May 16, 2019Updated 6 years ago
- A prototype for distributed training/validation/evaluation/extraction with PyTorch.☆14Jun 13, 2020Updated 5 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Feb 1, 2021Updated 5 years ago
- A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…☆297Jan 6, 2026Updated last month
- NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering☆65Mar 29, 2021Updated 4 years ago
- logboard: Monitor and Compare Logs on Browser/Terminal.☆21Sep 19, 2019Updated 6 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- Other than papers from big-name labs and universities, most AI research papers get less than 10 readers, even though there might be gems …☆15Jul 20, 2018Updated 7 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆169Oct 10, 2018Updated 7 years ago
- ☆19Feb 6, 2019Updated 7 years ago
- An VideoQA dataset based on the videos from ActivityNet☆91Nov 22, 2020Updated 5 years ago
- ☆15Aug 13, 2020Updated 5 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆23Jun 27, 2018Updated 7 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization☆23Mar 10, 2019Updated 6 years ago
- A Better Way to Attend: Attention with Trees for Video Question Answering☆25Mar 25, 2019Updated 6 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆25Nov 4, 2020Updated 5 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Sep 1, 2018Updated 7 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Dec 8, 2022Updated 3 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Oct 25, 2021Updated 4 years ago
- Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)☆58Nov 6, 2019Updated 6 years ago
- Implementation of the Budgeted Super Networks☆25Feb 25, 2019Updated 6 years ago
- Supporting code for ReCEval paper☆31Sep 14, 2024Updated last year
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- Repository for our CVPR 2017 and IJCV: TGIF-QA☆177Sep 6, 2021Updated 4 years ago
- ☆29Jun 23, 2018Updated 7 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Jul 4, 2018Updated 7 years ago