RaptorMai / MLLM-CompBenchLinks
[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes
☆40Updated 5 months ago
Alternatives and similar repositories for MLLM-CompBench
Users that are interested in MLLM-CompBench are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28Updated 2 years ago
- ☆67Updated 2 years ago
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆32Updated last year
- Test-Time Adaptation via Conjugate Pseudo-Labels☆42Updated 2 years ago
- Implementation for ECCV 2022 Paper "Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generali…☆20Updated 3 years ago
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆87Updated 10 months ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆57Updated 2 years ago
- [NeurIPS] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?☆70Updated 3 years ago
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆70Updated 5 months ago
- Code for Continuously Changing Corruptions (CCC) benchmark + evaluation☆37Updated last year
- ☆27Updated 2 years ago
- ☆22Updated last year
- ☆18Updated 2 years ago
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Updated last year
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆55Updated 6 months ago
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Updated last year
- IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models☆59Updated last year
- [CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zha…☆53Updated 2 years ago
- [NeurIPS 2023] Generalized Logit Adjustment☆38Updated last year
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆48Updated 2 years ago
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆98Updated last year
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆194Updated 2 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆58Updated 2 years ago
- source code for ICLR'23 paper "Non-parametric Outlier Synthesis"☆52Updated last year
- ☆58Updated 8 months ago
- ☆39Updated 2 years ago
- Official implementation of "Towards Distribution-Agnostic Generalized Category Discovery" (NIPS 2023)☆25Updated last year
- Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"☆110Updated 2 years ago
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆167Updated 2 years ago
- Create generated datasets and train robust classifiers☆36Updated 2 years ago