ItemZheng/KDDAug

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ItemZheng/KDDAug)

ItemZheng / KDDAug

[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering

☆13

Alternatives and similar repositories for KDDAug

Users that are interested in KDDAug are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
Dawn-LX / VidVRD-tracklets
View on GitHub
Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge
☆40Dec 5, 2022Updated 3 years ago
SRI-CSL / TrinityMultimodalTrojAI
View on GitHub
☆35Jun 27, 2022Updated 4 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zhongshsh / MoExtend
View on GitHub
ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"
☆15Dec 3, 2024Updated last year
val-iisc / RMLVQA
View on GitHub
☆19May 31, 2023Updated 3 years ago
Dawn-LX / VidSGG-BIG
View on GitHub
Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…
☆47Jul 11, 2023Updated 3 years ago
HKUST-LongGroup / CLIPDrag
View on GitHub
[ICLR 2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.
☆20May 6, 2025Updated last year
aditya10 / VLC-BERT
View on GitHub
Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"
☆21May 8, 2023Updated 3 years ago
tejas-gokhale / vqa_mutant
View on GitHub
☆13Feb 14, 2022Updated 4 years ago
ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆19Mar 14, 2024Updated 2 years ago
CGCL-codes / Gen-AF
View on GitHub
The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".
☆11Jun 28, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lancopku / CMAC
View on GitHub
The dataset and code for the paper "Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information"
☆19Oct 28, 2019Updated 6 years ago
thunlp / VisualDS
View on GitHub
☆24Apr 16, 2022Updated 4 years ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
PhoebusSi / Thinking-while-Observing
View on GitHub
Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"
☆12Jun 30, 2023Updated 3 years ago
cdancette / detect-shortcuts
View on GitHub
Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
☆29Jul 1, 2024Updated 2 years ago
yuleiniu / cfvqa
View on GitHub
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆136Dec 15, 2021Updated 4 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
yuleiniu / vc
View on GitHub
Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"
☆30Jul 4, 2018Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
chojw / genb
View on GitHub
Generative Bias for Robust Visual Question Answering ( CVPR 2023 )
☆28Jul 4, 2023Updated 3 years ago
ImKeTT / ReSee
View on GitHub
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
☆12Dec 4, 2023Updated 2 years ago
Aman-4-Real / See-or-Guess
View on GitHub
[ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning
☆16Feb 17, 2025Updated last year
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
dedekinds / NeurVec
View on GitHub
The official implementation of two AI-enhanced numerical solvers: NeurVec (Sci. Rep.) and AttNS (ICML'24)
☆27May 21, 2024Updated 2 years ago
nyukat / greedy_multimodal_learning
View on GitHub
Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
☆29May 25, 2022Updated 4 years ago
LivXue / VCIN
View on GitHub
Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…
☆13Apr 13, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gicheonkang / sglkt-visdial
View on GitHub
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
☆13Feb 1, 2023Updated 3 years ago
wakuwu / OSFD
View on GitHub
(AAAI 2024) Transferable Adversarial Attacks for Object Detection using Object-Aware Significant Feature Distortion
☆17Dec 13, 2023Updated 2 years ago
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago
MengyuanChen21 / Awesome-Visual-Dialog
View on GitHub
A curated publication list on visual dialog
☆14May 8, 2023Updated 3 years ago
Phantivia / T-PGD
View on GitHub
[Findings of ACL 2023] Bridge the Gap Between CV and NLP! A Optimization-based Textual Adversarial Attack Framework.
☆14Aug 27, 2023Updated 2 years ago
mad-red / VSR-guided-CIC
View on GitHub
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Mar 11, 2022Updated 4 years ago
zhangxi1997 / VQACL
View on GitHub
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
☆45Mar 28, 2024Updated 2 years ago