PhoebusSi / VQA-VS
Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"
☆35Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for VQA-VS
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆22Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Updated 2 years ago
- Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"☆12Updated last year
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆116Updated 2 years ago
- ☆18Updated 2 years ago
- ☆28Updated last year
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆56Updated 3 years ago
- ☆113Updated 2 years ago
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)☆40Updated 2 years ago
- ACL'2023: Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning☆41Updated 2 years ago
- ☆63Updated 5 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆29Updated last year
- ☆101Updated 2 years ago
- ☆16Updated last year
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Updated 2 years ago
- ☆30Updated 11 months ago
- ☆15Updated 2 years ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆29Updated 5 months ago
- my commonly-used tools☆47Updated 3 months ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆16Updated 5 months ago
- Official repository for the A-OKVQA dataset☆64Updated 6 months ago
- ☆59Updated last year
- Source code for multimodal dialogue systems with semantic elements (MATE, He et al., 2020).☆8Updated 2 years ago
- Recent Advances in Visual Dialog☆30Updated 2 years ago
- ☆33Updated last year
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆27Updated 10 months ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆38Updated 2 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆88Updated last year