MayankSingal / VQA-Transformer
Visual Question Answering through transformers.
☆13Updated 6 years ago
Alternatives and similar repositories for VQA-Transformer:
Users that are interested in VQA-Transformer are comparing it to the libraries listed below
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Updated 6 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆80Updated 4 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Updated 4 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆89Updated 5 years ago
- Chinese Visual Question Answering 中文看图问答☆47Updated 7 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆38Updated 5 years ago
- Neural Machine Translation with universal Visual Representation (ICLR 2020)☆88Updated 4 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆102Updated 6 years ago
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…☆119Updated 4 years ago
- image captioning paper list☆8Updated 5 years ago
- 👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)☆9Updated 5 years ago
- Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper☆24Updated 4 years ago
- Code for paper "Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling"☆7Updated 5 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆26Updated 3 years ago
- ☆33Updated 4 years ago
- Code of Dense Relational Captioning☆69Updated 2 years ago
- ☆44Updated 2 years ago
- ☆28Updated 5 years ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆163Updated 2 years ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆58Updated 7 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Updated 2 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 5 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Updated last year
- PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"☆24Updated 2 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 3 years ago
- Implementation of "MULE: Multimodal Universal Language Embedding"☆16Updated 5 years ago
- vist story telling evaluation tool☆21Updated last year
- ☆63Updated 3 years ago
- A collection of models for image<->text generation in ACM MM 2021.☆66Updated 3 years ago
- Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144☆58Updated last year