MayankSingal / VQA-TransformerLinks
Visual Question Answering through transformers.
☆13Updated 6 years ago
Alternatives and similar repositories for VQA-Transformer
Users that are interested in VQA-Transformer are comparing it to the libraries listed below
Sorting:
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆80Updated 4 years ago
- Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" (EMNLP 2018)☆36Updated 6 years ago
- Code of Dense Relational Captioning☆69Updated 2 years ago
- [EMNLP 2018] Training for Diversity in Image Paragraph Captioning☆89Updated 5 years ago
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆65Updated 4 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Updated 5 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- Chinese Visual Question Answering 中文看图问答☆47Updated 7 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 3 years ago
- Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper☆24Updated 4 years ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆164Updated 2 years ago
- Official code and data for EMNLP 2020 paper "Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attenti…☆21Updated 4 years ago
- This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Co…☆29Updated 5 years ago
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆50Updated 5 years ago
- Medical image caption ImageCLEF 2018☆29Updated 7 years ago
- ☆38Updated 2 years ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27Updated 4 years ago
- 👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)☆9Updated 5 years ago
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…☆21Updated 4 years ago
- ☆44Updated 2 years ago
- A visualizer to display attention weights on text☆23Updated 6 years ago
- Domain Adaptation using External Knowledge for Sentiment Analysis☆54Updated last year
- TalkSumm - Scientific Paper Summarization Based on Conference Talks☆43Updated 3 years ago
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆26Updated 4 years ago
- PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"☆24Updated 2 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆38Updated 5 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Updated 2 years ago
- Implementation of "MULE: Multimodal Universal Language Embedding"☆16Updated 5 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Updated 2 years ago
- PyTorch implementation of the paper "Hyperbolic Interaction Model For Hierarchical Multi-Label Classification"☆47Updated 5 years ago