RaptorMai / MLLM-CompBenchLinks
[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes
☆41Updated 9 months ago
Alternatives and similar repositories for MLLM-CompBench
Users that are interested in MLLM-CompBench are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28Updated 2 years ago
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆32Updated 2 years ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆60Updated 2 years ago
- IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models☆59Updated 2 years ago
- Test-Time Adaptation via Conjugate Pseudo-Labels☆42Updated 2 years ago
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆48Updated 3 years ago
- ☆72Updated 2 years ago