☆12Mar 8, 2021Updated 4 years ago
Alternatives and similar repositories for catt
Users that are interested in catt are comparing it to the libraries listed below
Sorting:
- ☆79Oct 8, 2022Updated 3 years ago
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆29Jul 1, 2024Updated last year
- my Ph.D. thesis (Zhejiang University)☆38Apr 9, 2022Updated 3 years ago
- https://arxiv.org/abs/2106.12442☆10Jun 22, 2021Updated 4 years ago
- The official PyTorch Implementation of the Paper "Adversarial Visual Robustness by Causal Intervention"☆18Oct 6, 2021Updated 4 years ago
- BottomUpTopDown VQA model with question-type debiasing☆22Oct 6, 2019Updated 6 years ago
- ☆10Mar 21, 2022Updated 3 years ago
- EMNLP'2020: Look at the First Sentence: Position Bias in Question Answering☆29Nov 4, 2020Updated 5 years ago
- [ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition☆80Dec 1, 2023Updated 2 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- ☆17Sep 2, 2023Updated 2 years ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆130Dec 15, 2021Updated 4 years ago
- Official PyTorch Implementation for Testing of TransZero++(TPAMI'22)☆10Aug 25, 2023Updated 2 years ago
- ☆64Jan 5, 2022Updated 4 years ago
- the implementation of EMNLP 2020 "Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering"☆15Sep 9, 2021Updated 4 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- ☆34Jul 28, 2021Updated 4 years ago
- NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering☆65Mar 29, 2021Updated 4 years ago
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Apr 24, 2020Updated 5 years ago
- A collections of papers about VQA-CP datasets and their results☆41Mar 18, 2022Updated 3 years ago
- ☆17Mar 13, 2023Updated 2 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- ☆20Oct 21, 2022Updated 3 years ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 6 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- ☆20Sep 28, 2020Updated 5 years ago
- ☆23Aug 28, 2023Updated 2 years ago
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 2 years ago
- Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…☆46Jul 27, 2019Updated 6 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆69Oct 11, 2021Updated 4 years ago
- ☆26Apr 15, 2021Updated 4 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- ☆24May 22, 2023Updated 2 years ago