allenai / multimodalqa
☆116Updated 2 years ago
Alternatives and similar repositories for multimodalqa:
Users that are interested in multimodalqa are comparing it to the libraries listed below
- ☆47Updated last month
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆28Updated 3 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆63Updated 2 years ago
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆105Updated 2 years ago
- ☆34Updated last year
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆19Updated 2 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Updated 2 years ago
- Lexically constrained text generation with CBART.☆48Updated 2 years ago
- The code for lifelong few-shot language learning☆55Updated 2 years ago
- Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.☆17Updated 2 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated last year
- ☆44Updated 9 months ago
- ☆44Updated 2 years ago
- Multitask Multilingual Multimodal Pre-training☆71Updated 2 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation☆30Updated 3 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆53Updated 3 years ago
- ☆60Updated 2 years ago
- Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)☆71Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆100Updated 2 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆31Updated last month
- ☆28Updated 11 months ago
- Source code for "Transforming Question Answering Datasets Into Natural Language Inference Datasets"☆60Updated 5 years ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Updated 4 years ago
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆41Updated 2 years ago
- ☆31Updated last year
- ☆101Updated 2 years ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆42Updated 3 months ago
- ☆69Updated 10 months ago
- ☆88Updated 2 years ago