Jielin-Qiu / MM_RobustnessLinks

[DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift

☆38

Alternatives and similar repositories for MM_Robustness

Users that are interested in MM_Robustness are comparing it to the libraries listed below

Sorting:

eric-ai-lab / CPL
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆34Updated 3 years ago
ajd12342 / why-winoground-hard
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31Updated 2 years ago
chingyaoc / debias_vl
Code for Debiasing Vision-Language Models via Biased Prompts
☆60Updated 2 years ago
YiyangZhou / POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
☆91Updated last year
yuhui-zh15 / drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
☆34Updated 2 years ago
limanling / KnowledgeVL-Reading
☆67Updated 2 years ago
yiren-jian / BLIText
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
☆26Updated 2 years ago
bcdnlp / FAITHSCORE
FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models
☆32Updated last month
MikeWangWZHL / Paxion
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆37Updated 2 years ago
sIncerass / MVLPT
code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720
☆56Updated last year
Yuqifan1117 / HalluciDoctor
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)
☆51Updated last year
lancopku / clip-openness
[ACL 2023] Delving into the Openness of CLIP
☆23Updated 2 years ago
google-deepmind / svo_probes
The SVO-Probes Dataset for Verb Understanding
☆31Updated 3 years ago
cvlab-columbia / DoubleRight
☆27Updated last year
YiyangZhou / LURE
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
☆155Updated last year
GLAMOR-USC / CLiMB
The Continual Learning in Multimodality Benchmark
☆68Updated 2 years ago
showlab / CLVQA
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆40Updated last year
Letian2003 / C-VQA
Counterfactual Reasoning VQA Dataset
☆27Updated 2 years ago
LisaAnne / Hallucination
☆88Updated 6 years ago
rabiulcste / vqazero
visual question answering prompting recipes for large vision-language models
☆28Updated last year
haoyiq114 / VALOR
Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)
☆16Updated last year
yfzhang114 / LLaVA-Align
[ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…
☆82Updated 10 months ago
ExplainableML / WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆61Updated 2 years ago
changdaeoh / multimodal-mixup
Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"
☆36Updated last year
deeplearning-wisc / vit-spurious-robustness
☆27Updated 2 years ago
iCGY96 / awesome_concept_learning_list
A curated list of papers & resources linked to concept learning
☆13Updated 2 years ago
allenai / aokvqa
Official repository for the A-OKVQA dataset
☆106Updated last year
SivanDoveh / TSVLC
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆47Updated 2 years ago
gzcch / Bingo
☆55Updated last year
yossigandelsman / second_order_lens
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
☆42Updated last year