Maitreyapatel / CRIPP-VQA
CRIPP-VQA Benchmark -- EMNLP, 2022
☆9Updated 2 years ago
Alternatives and similar repositories for CRIPP-VQA:
Users that are interested in CRIPP-VQA are comparing it to the libraries listed below
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆34Updated last year
- ☆38Updated 2 years ago
- ☆15Updated last year
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Updated 6 months ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆39Updated 3 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated 10 months ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆38Updated 11 months ago
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆23Updated last year
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆32Updated last year
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- ☆67Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆30Updated last year
- Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020☆69Updated 10 months ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆65Updated 2 years ago
- Counterfactual Reasoning VQA Dataset☆24Updated last year
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆13Updated 2 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Updated 2 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆47Updated this week
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- ☆40Updated last year
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37Updated last year
- Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling☆35Updated 2 years ago
- Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"☆44Updated 3 years ago
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- Official code repo for "ProTo: program-guided Transformers for Program-guided Tasks☆20Updated 2 years ago
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆19Updated last year
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Updated last year