[ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning
☆22Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for EvALign-ICL
Users that are interested in EvALign-ICL are comparing it to the libraries listed below
Sorting:
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆51Jun 10, 2021Updated 4 years ago
- ☆18Oct 29, 2021Updated 4 years ago
- The repository for the official Biased Action Recognition (BAR) dataset for the paper Learning from Failure: Training Debiased Classifier…☆35Nov 10, 2020Updated 5 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆43Oct 24, 2023Updated 2 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆17Jul 29, 2021Updated 4 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆21May 28, 2025Updated 9 months ago
- Code for the paper "Partially-Aligned Data-to-Text Generation with Distant Supervision" in EMNLP 2020.☆19Jan 15, 2021Updated 5 years ago
- ☆48Jan 17, 2023Updated 3 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- Source code for the Nature Machine Intelligence paper: When and how convolutional neural networks generalize to out-of-distribution categ…☆24Feb 26, 2022Updated 4 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆61Apr 8, 2024Updated last year
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Jan 10, 2021Updated 5 years ago
- ☆110Sep 20, 2023Updated 2 years ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Aug 22, 2023Updated 2 years ago
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆36Oct 25, 2022Updated 3 years ago
- ☆34Mar 13, 2021Updated 4 years ago
- ☆38Feb 8, 2024Updated 2 years ago
- ☆38Jul 13, 2022Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Framework to develop and back-test online strategies for Uniswap v3 liquidity provision.☆11Jun 14, 2023Updated 2 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆40Apr 7, 2022Updated 3 years ago
- Recycling diverse models☆46Jan 18, 2023Updated 3 years ago
- A Picture Management software using MFC☆10Sep 16, 2013Updated 12 years ago
- Use OpenCV to convert a raw bayer image from a sensor to rgb☆12Apr 2, 2011Updated 14 years ago
- ☆36Jun 16, 2021Updated 4 years ago
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆43Dec 5, 2021Updated 4 years ago
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆70May 2, 2025Updated 10 months ago
- CaDiCaL + neural glue variable predictions☆10Oct 21, 2020Updated 5 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆10Nov 1, 2019Updated 6 years ago
- Utility functions for weights and biases (wandb).☆11Sep 17, 2024Updated last year
- ☆11Mar 13, 2023Updated 2 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- This is our ARTS test set, an enriched test set to probe Aspect Robustness of ABSA.☆42Jan 16, 2024Updated 2 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Jul 25, 2024Updated last year