RaptorMai / CompBench
CompBench evaluates the comparative reasoning of multimodal large language models (MLLMs) with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scen…
☆31Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for CompBench
- Official Implementation of "Fine-Tuning is Fine, if Calibrated.", NeurIPS 2024☆14Updated last month
- Code for Continuously Changing Corruptions (CCC) benchmark + evaluation☆29Updated 3 months ago
- Test-Time Adaptation via Conjugate Pseudo-Labels☆38Updated last year
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Updated 11 months ago
- Lessons Learned from a Unifying Empirical Study of Parameter-Efficient Transfer Learning (PETL) in Visual Recognition☆25Updated 3 weeks ago
- ☆37Updated 2 years ago
- ☆41Updated last year
- [NeurIPS21] TTT++: When Does Self-supervised Test-time Training Fail or Thrive?☆59Updated 2 years ago
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28Updated last year
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆28Updated 11 months ago
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆55Updated last year
- PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.☆18Updated 7 months ago
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆76Updated last week
- [NeurIPS 2023 Spotlight] Combating Representation Learning Disparity with Geometric Harmonization☆20Updated 9 months ago
- Respect to the input tensor instead of paramters of NN☆15Updated 2 years ago
- Create generated datasets and train robust classifiers☆35Updated last year
- ☆24Updated last year
- Code and data for the paper "In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation"☆24Updated last year
- "Scalable and Order-robust Continual Learning with Additive Parameter Decomposition", ICLR 2020☆22Updated 2 years ago
- Official Code for Dataset Distillation using Neural Feature Regression (NeurIPS 2022)☆46Updated 2 years ago
- This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2…☆93Updated 2 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆53Updated last year
- Official repository of "Back to Source: Diffusion-Driven Test-Time Adaptation"☆71Updated 11 months ago
- Official implementation for 'Class-Balancing Diffusion Models'☆46Updated 6 months ago
- ☆60Updated last year
- Augmenting with Language-guided Image Augmentation (ALIA)☆63Updated last year
- [ICLR 2024 Spotlight] Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization☆26Updated last month
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆21Updated 2 weeks ago
- ☆16Updated 2 years ago