Maitreyapatel / CRIPP-VQA
CRIPP-VQA Benchmark -- EMNLP, 2022
☆9Updated 2 years ago
Alternatives and similar repositories for CRIPP-VQA:
Users that are interested in CRIPP-VQA are comparing it to the libraries listed below
- ☆39Updated 2 years ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆39Updated 4 years ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Updated 10 months ago
- ☆16Updated last month
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆36Updated last year
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆13Updated 2 years ago
- ☆68Updated last year
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 10 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆26Updated 10 months ago
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆36Updated last year
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆36Updated 2 years ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆57Updated last year
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated 2 years ago
- Counterfactual Reasoning VQA Dataset☆25Updated last year
- ☆29Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- A mini-framework for running AI2-Thor with Docker.☆34Updated last year
- Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling☆36Updated 3 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated last year
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43Updated 3 years ago
- ☆26Updated 3 years ago
- [NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language☆46Updated 2 years ago
- Code, data, models for the Sherlock corpus☆57Updated 2 years ago