RaptorMai / MLLM-CompBenchLinks

[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes

☆38

Alternatives and similar repositories for MLLM-CompBench

Users that are interested in MLLM-CompBench are comparing it to the libraries listed below

Sorting:

deeplearning-wisc / vit-spurious-robustness
☆26Updated 2 years ago
deeplearning-wisc / poem
PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022
☆28Updated 2 years ago
princetonvisualai / RememberThePast-DatasetDistillation
☆39Updated 2 years ago
locuslab / tta_conjugate
Test-Time Adaptation via Conjugate Pseudo-Labels
☆41Updated 2 years ago
yongchaoz / FRePo
Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)
☆47Updated 2 years ago
virajprabhu / LANCE
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
☆32Updated last year
thu-ml / HiDe-Prompt
Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)
☆86Updated 7 months ago
FarinaMatteo / zero
[NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!
☆47Updated 3 months ago
oripress / CCC
Code for Continuously Changing Corruptions (CCC) benchmark + evaluation
☆35Updated 10 months ago
vita-epfl / ttt-plus-plus
[NeurIPS21] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?
☆70Updated 3 years ago
rgeirhos / dataset-pruning-metrics
Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)
☆56Updated 2 years ago
PotatoTian / TPGM
This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning
☆24Updated last year
justincui03 / dc_benchmark
☆86Updated 2 years ago
SprocketLab / roboshot
☆22Updated last year
OSU-MLB / ViT_PEFT_Vision
[CVPR'25 (Highlight)] Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition
☆39Updated this week
deeplearning-wisc / dream-ood
source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"
☆68Updated 2 months ago
chingyaoc / debias_vl
Code for Debiasing Vision-Language Models via Biased Prompts
☆56Updated 2 years ago
deeplearning-wisc / scone
☆9Updated last year
JoyHuYY1412 / Class_Imbalanced_Semi_Supervised_Learning
☆18Updated 2 years ago
hee-suk-yoon / C-TPT
[ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"
☆16Updated last year
yossigandelsman / second_order_lens
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
☆39Updated 7 months ago
jaehong31 / APD
"Scalable and Order-robust Continual Learning with Additive Parameter Decomposition", ICLR 2020
☆23Updated 3 years ago
Saehyung-Lee / DCC
This repository is the official implementation of Dataset Condensation with Contrastive Signals (DCC), accepted at ICML 2022.
☆21Updated 3 years ago
zhangmarvin / memo
☆66Updated 2 years ago
cvlab-columbia / ZSRobust4FoundationModel
☆43Updated 2 years ago
ExplainableML / WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆57Updated last year
yu-rp / Distribution-Shift-Iverson
☆42Updated last year
PolinaKirichenko / deep_feature_reweighting
☆107Updated last year
princetonvisualai / multimodal_dataset_distillation
☆56Updated 5 months ago
LzVv123456 / Contrastive-Prototypical-Prompt
☆21Updated 3 months ago