MayankSingal / VQA-Transformer
Visual Question Answering through transformers.
☆13Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for VQA-Transformer
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Updated 4 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆37Updated 6 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆64Updated 4 years ago
- Chinese Visual Question Answering 中文看图问答☆47Updated 7 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆34Updated 4 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆90Updated 5 years ago
- Code of Dense Relational Captioning☆67Updated last year
- Visual Question Generation reading list☆27Updated 4 years ago
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆14Updated 11 months ago
- vist story telling evaluation tool☆21Updated 11 months ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Updated 5 years ago
- ☆62Updated 2 years ago
- Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.☆12Updated 5 years ago
- ☆39Updated last year
- implement n2nmn with pytorch☆19Updated 5 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆58Updated 6 years ago
- Tensorflow implementation of C. Gan, Z. Gan, X. He, J. Gao, and L. Deng, “StyleNet: Generating Attractive Visual Captions with Styles”☆9Updated 6 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 4 years ago
- Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144☆57Updated 11 months ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆34Updated 3 years ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆91Updated 7 months ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆65Updated last year
- Starter code for the VMT task and challenge☆51Updated 4 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆37Updated last year
- This is a code repository of Graphhopper: Multi-Hop Scene GraphReasoning for Visual Question Answering☆18Updated 3 years ago
- A GCN based visual question generation model☆13Updated 5 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆49Updated 4 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆18Updated 7 years ago