Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"
☆22Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for rex
Users that are interested in rex are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆69Oct 11, 2021Updated 4 years ago
- Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…☆12Jun 27, 2025Updated 8 months ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆48Nov 3, 2022Updated 3 years ago
- GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.☆32Mar 1, 2021Updated 5 years ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆27Oct 13, 2022Updated 3 years ago
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)☆33Sep 29, 2021Updated 4 years ago
- A Survey on Interpretable Cross-modal Reasoning☆15Oct 12, 2023Updated 2 years ago
- ☆11Feb 1, 2023Updated 3 years ago
- This is the official implementation of the Video Dialog as Conversation about Objects Living in Space-Time paper☆32Jul 11, 2022Updated 3 years ago
- Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)☆13Apr 6, 2019Updated 6 years ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆66Dec 2, 2022Updated 3 years ago
- ☆18Dec 8, 2022Updated 3 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- [ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…☆20Jul 21, 2022Updated 3 years ago
- ☆12Jun 17, 2020Updated 5 years ago
- ☆18May 31, 2023Updated 2 years ago
- ☆19Dec 25, 2021Updated 4 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆50Jan 30, 2024Updated 2 years ago
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆20May 10, 2022Updated 3 years ago
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆44Aug 27, 2022Updated 3 years ago
- ☆40Nov 29, 2022Updated 3 years ago
- Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …☆24Oct 19, 2022Updated 3 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…☆46Jun 5, 2023Updated 2 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)☆87Apr 10, 2022Updated 3 years ago
- Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"☆50Jun 29, 2021Updated 4 years ago
- [IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)☆21Aug 31, 2022Updated 3 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- Code used in ACL rebuttal☆31Sep 3, 2024Updated last year