Maitreyapatel / CRIPP-VQALinks

CRIPP-VQA Benchmark -- EMNLP, 2022

☆9

Alternatives and similar repositories for CRIPP-VQA

Users that are interested in CRIPP-VQA are comparing it to the libraries listed below

Sorting:

microsoft / DFOL-VQA
Differentiable First-Order Logic Reasoning for Visual Question Answering
☆40Updated 4 years ago
zfchenUnique / DCL-Release
This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).
☆37Updated last year
Jielin-Qiu / MM_Robustness
[DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift
☆37Updated last year
Lizw14 / CaliCO
Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
☆14Updated 2 years ago
zfchenUnique / compositional_physics_learner
☆39Updated 3 years ago
limanling / KnowledgeVL-Reading
☆68Updated 2 years ago
WellyZhang / PrAE
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
☆26Updated 4 years ago
UMass-Embodied-AGI / genome
☆16Updated 4 months ago
ajd12342 / why-winoground-hard
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆30Updated 2 years ago
GLAMOR-USC / CLiMB
The Continual Learning in Multimodality Benchmark
☆67Updated 2 years ago
cambridgeltl / visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
☆128Updated 2 years ago
dingmyu / VRDP
[NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
☆46Updated 2 years ago
WellyZhang / ACRE
ACRE: Abstract Causal REasoning Beyond Covariation
☆19Updated 3 years ago
WellyZhang / RAVEN
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
☆176Updated 3 months ago
wildphoton / Compositional-Generalization
[NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…
☆13Updated 2 years ago
serre-lab / CVR
A Benchmark for Efficient and Compositional Visual Reasoning
☆25Updated 2 years ago
chuangg / CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
☆121Updated 4 years ago
JindongJiang / GNM
Official Release of NeurIPS 2020 Spotlight paper "Generative Neurosymbolic Machines"
☆35Updated last year
sjtuytc / Neurips21-ProTo-Program-guided-Transformers-for-Program-guided-Tasks
Official code repo for "ProTo: program-guided Transformers for Program-guided Tasks
☆21Updated 3 years ago
shizhediao / DaVinci
Source code for the paper "Prefix Language Models are Unified Modal Learners"
☆43Updated 2 years ago
yuhui-zh15 / drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
☆33Updated 2 years ago
belindal / LaMPP
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Updated 2 years ago
hughplay / TVR
Transformation Driven Visual Reasoning - CVPR 2021
☆37Updated 2 years ago
LisaAnne / Hallucination
☆75Updated 6 years ago
StanLei52 / TQVSR
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆23Updated last year
alexpashevich / E.T.
Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…
☆91Updated 2 years ago
evelinehong / PTR
Official Repository of NeurIPS2021 paper: PTR
☆33Updated 3 years ago
easonnie / mlp-vil
MLPs for Vision and Langauge Modeling (Coming Soon)
☆27Updated 3 years ago
wenhuchen / Meta-Module-Network
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43Updated 4 years ago
evelinehong / VLGrammar
☆29Updated 3 years ago