RaptorMai / MLLM-CompBenchLinks
[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes
☆38Updated 2 months ago
Alternatives and similar repositories for MLLM-CompBench
Users that are interested in MLLM-CompBench are comparing it to the libraries listed below
Sorting:
- ☆26Updated 2 years ago
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28Updated 2 years ago
- ☆39Updated 2 years ago
- Test-Time Adaptation via Conjugate Pseudo-Labels☆41Updated 2 years ago
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆47Updated 2 years ago
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆32Updated last year
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆86Updated 7 months ago
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆47Updated 3 months ago
- Code for Continuously Changing Corruptions (CCC) benchmark + evaluation☆35Updated 10 months ago
- [NeurIPS21] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?☆70Updated 3 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆56Updated 2 years ago
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Updated last year
- ☆86Updated 2 years ago
- ☆22Updated last year
- [CVPR'25 (Highlight)] Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition☆39Updated this week
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆68Updated 2 months ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆56Updated 2 years ago
- ☆9Updated last year
- ☆18Updated 2 years ago
- [ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"☆16Updated last year
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆39Updated 7 months ago
- "Scalable and Order-robust Continual Learning with Additive Parameter Decomposition", ICLR 2020☆23Updated 3 years ago
- This repository is the official implementation of Dataset Condensation with Contrastive Signals (DCC), accepted at ICML 2022.☆21Updated 3 years ago
- ☆66Updated 2 years ago
- ☆43Updated 2 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆57Updated last year
- ☆42Updated last year
- ☆107Updated last year
- ☆56Updated 5 months ago
- ☆21Updated 3 months ago