Maitreyapatel / CRIPP-VQA
CRIPP-VQA Benchmark -- EMNLP, 2022
☆9Updated 2 years ago
Alternatives and similar repositories for CRIPP-VQA:
Users that are interested in CRIPP-VQA are comparing it to the libraries listed below
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆34Updated last year
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated 10 months ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- ☆15Updated last year
- ☆67Updated last year
- ☆38Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37Updated last year
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Counterfactual Reasoning VQA Dataset☆24Updated last year
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 7 months ago
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆13Updated 2 years ago
- GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.☆28Updated 3 years ago
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆39Updated 3 years ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆31Updated 11 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆27Updated 3 months ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Updated 7 months ago
- [NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language☆45Updated last year
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆15Updated 9 months ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆112Updated last year
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆29Updated last year
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆24Updated 3 months ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- ☆64Updated 5 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆21Updated last year