kushalkafle / DVQA_datasetLinks
DVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018
☆37Updated 6 years ago
Alternatives and similar repositories for DVQA_dataset
Users that are interested in DVQA_dataset are comparing it to the libraries listed below
Sorting:
- ☆26Updated 5 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆56Updated 5 months ago
- Code, data, models for the Sherlock corpus☆58Updated 2 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Updated 3 years ago
- ☆136Updated 2 years ago
- ☆45Updated 3 months ago
- PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"☆190Updated 4 years ago
- Multimodal Graph Network (MGN): Code repo, examples from the paper☆25Updated 4 years ago
- ☆117Updated last year
- Dataset and starting code for visual entailment dataset☆112Updated 3 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Updated 2 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 3 years ago
- ☆25Updated 3 years ago
- Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO☆52Updated 5 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆115Updated 3 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆79Updated 2 years ago
- Visual Storytelling post-edit dataset☆18Updated 5 years ago
- Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)☆68Updated 4 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆88Updated 2 years ago
- SciCap Dataset☆56Updated 3 years ago
- ☆40Updated 2 years ago
- [EMNLP 2021] The baseline code for WebSRC dataset.☆50Updated 5 months ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper☆24Updated 4 years ago
- Pre-trained V+L Data Preparation☆46Updated 5 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆68Updated 3 years ago
- ☆16Updated 3 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆97Updated 5 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆224Updated 3 years ago
- ☆17Updated 6 months ago