RaptorMai / CompBench
CompBench evaluates the comparative reasoning of multimodal large language models (MLLMs) with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scen…
☆36Updated 5 months ago
Alternatives and similar repositories for CompBench:
Users that are interested in CompBench are comparing it to the libraries listed below
- Code for Continuously Changing Corruptions (CCC) benchmark + evaluation☆31Updated 4 months ago
- Test-Time Adaptation via Conjugate Pseudo-Labels☆39Updated last year
- ☆42Updated last year
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28Updated last year
- [NeurIPS21] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?☆62Updated 2 years ago
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆57Updated last year
- ☆24Updated last year
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Updated last year
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆35Updated 2 months ago
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆78Updated 2 months ago
- ☆15Updated 2 years ago
- ☆37Updated 2 years ago
- ☆48Updated last year
- ☆16Updated 2 years ago
- ☆50Updated 2 weeks ago
- [CVPR23] "Understanding and Improving Visual Prompting: A Label-Mapping Perspective" by Aochuan Chen, Yuguang Yao, Pin-Yu Chen, Yihua Zha…☆52Updated last year
- ☆27Updated last year
- ☆17Updated last year
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆28Updated last year
- [NeurIPS 2022] The official code for our NeurIPS 2022 paper "Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnab…☆44Updated 2 years ago
- Official implementation for 'Class-Balancing Diffusion Models'☆46Updated 8 months ago
- ☆60Updated last year
- This code accompanies the paper "Parameter-free Online Test-time Adaptation".☆67Updated 2 years ago
- PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.☆19Updated 9 months ago
- This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2…☆95Updated 3 years ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆47Updated 2 months ago
- [NeurIPS 2023] Generalized Logit Adjustment☆33Updated 8 months ago
- "Scalable and Order-robust Continual Learning with Additive Parameter Decomposition", ICLR 2020☆22Updated 2 years ago
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year
- CIFAR-10-Warehouse: Towards Broad and More Realistic Testbeds in Model Generalization Analysis☆18Updated 6 months ago