allenai / dqa-net
Diagram question answering system described in "A Diagram is Worth a Dozen Images"
☆38Updated 7 years ago
Alternatives and similar repositories for dqa-net:
Users that are interested in dqa-net are comparing it to the libraries listed below
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- GuessWhat?! Baselines☆73Updated 2 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Updated 3 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Updated 3 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 4 years ago
- Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Ar…☆26Updated 4 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆31Updated 5 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Updated 6 years ago
- Official Github repo of the VIST Challenge NAACL 2018☆17Updated 6 years ago
- vist story telling evaluation tool☆21Updated last year
- ☆24Updated 5 years ago
- Pre-trained V+L Data Preparation☆45Updated 4 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Updated 5 years ago
- Visual Verb Sense Disambiguation☆13Updated 5 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆20Updated 8 years ago
- Self-supervised learning of visual features through embedding images into text topic spaces☆94Updated 2 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆67Updated 6 years ago
- Visual Storytelling API☆35Updated 8 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Updated 5 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆79Updated 2 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Updated 6 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- The implementation of Text-guided Attention Model for Image Captioning☆21Updated 7 years ago
- Support, annotation, evaluation, and baseline models for the imSitu dataset.☆58Updated 4 years ago
- Scene Graph Parsing as Dependency Parsing☆41Updated 5 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Updated last year
- Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359☆14Updated 5 years ago
- Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"☆13Updated 4 years ago
- [COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.☆57Updated 5 years ago
- tensorflow Implementation of https://github.com/facebookresearch/MIXER☆11Updated 7 years ago