[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
Alternatives and similar repositories for KDDAug
Users that are interested in KDDAug are comparing it to the libraries listed below
Sorting:
- ☆34Jun 27, 2022Updated 3 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Dec 5, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆14Dec 3, 2024Updated last year
- ☆13Feb 14, 2022Updated 4 years ago
- [ICLR2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.☆20May 6, 2025Updated 10 months ago
- ☆18May 31, 2023Updated 2 years ago
- Using image captions with LLM for zero-shot VQA☆18Mar 14, 2024Updated last year
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆48Jul 11, 2023Updated 2 years ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆29Jul 1, 2024Updated last year
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- ☆25Apr 16, 2022Updated 3 years ago
- Generative Bias for Robust Visual Question Answering ( CVPR 2023 )☆28Jul 4, 2023Updated 2 years ago
- The official implementation of two AI-enhanced numerical solvers: NeurVec (Sci. Rep.) and AttNS (ICML'24)☆27May 21, 2024Updated last year
- ☆30Dec 16, 2022Updated 3 years ago
- Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks☆30May 25, 2022Updated 3 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Jul 4, 2018Updated 7 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆130Dec 15, 2021Updated 4 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Apr 30, 2024Updated last year
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆32Oct 19, 2023Updated 2 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- Human-like Controllable Image Captioning with Verb-specific Semantic Roles.☆36Mar 11, 2022Updated 3 years ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries☆10Dec 28, 2018Updated 7 years ago
- Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection☆43Jun 4, 2024Updated last year
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆44Mar 28, 2024Updated last year
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…☆17Jan 24, 2025Updated last year
- ☆10May 4, 2018Updated 7 years ago
- ☆12Dec 20, 2024Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- Library for automatic time series forecasting based on ARIMA models☆12May 14, 2017Updated 8 years ago
- ☆10Oct 6, 2021Updated 4 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago