allenai / multimodalqaLinks
☆129Updated 2 years ago
Alternatives and similar repositories for multimodalqa
Users that are interested in multimodalqa are comparing it to the libraries listed below
Sorting:
- ☆52Updated 5 months ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆64Updated 2 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆36Updated 6 months ago
- ☆18Updated last year
- ☆45Updated last year
- ☆35Updated 2 years ago
- ☆32Updated last year
- Lexically constrained text generation with CBART.☆49Updated 2 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆55Updated 2 months ago
- Repository for VisualSem: a high-quality knowledge graph to support research in vision and language.☆88Updated 2 years ago
- ☆104Updated 3 years ago
- The code for lifelong few-shot language learning☆55Updated 3 years ago
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆42Updated 3 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆72Updated 3 years ago
- ☆62Updated 2 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Updated 3 years ago
- Source code for "Transforming Question Answering Datasets Into Natural Language Inference Datasets"☆62Updated 6 years ago
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆106Updated 2 years ago
- KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation☆31Updated 3 years ago
- ☆32Updated last year
- Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".☆17Updated 2 years ago
- Multitask Multilingual Multimodal Pre-training☆72Updated 2 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆28Updated 3 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)☆73Updated last year
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated 2 years ago
- ☆38Updated last year
- Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.☆17Updated 2 years ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Updated 5 years ago