[ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning
☆22Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for EvALign-ICL
Users that are interested in EvALign-ICL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆21May 28, 2025Updated 11 months ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆52Jun 10, 2021Updated 4 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆44Oct 24, 2023Updated 2 years ago
- The repository for the official Biased Action Recognition (BAR) dataset for the paper Learning from Failure: Training Debiased Classifier…☆35Nov 10, 2020Updated 5 years ago
- ☆18Oct 29, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆35Aug 30, 2021Updated 4 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆36Oct 25, 2022Updated 3 years ago
- ☆48Jan 17, 2023Updated 3 years ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".☆17May 15, 2023Updated 2 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Jul 26, 2025Updated 9 months ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Sep 25, 2024Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆47Feb 19, 2026Updated 2 months ago
- ☆110Sep 20, 2023Updated 2 years ago
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated 2 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- ☆18May 25, 2022Updated 3 years ago
- ☆35Mar 13, 2021Updated 5 years ago
- ☆144Oct 2, 2020Updated 5 years ago
- ☆14Apr 28, 2023Updated 3 years ago
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆20May 10, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆294Jun 7, 2023Updated 2 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆16Jul 29, 2021Updated 4 years ago
- ☆25Jun 22, 2023Updated 2 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- Recycling diverse models☆46Jan 18, 2023Updated 3 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆80Nov 20, 2025Updated 5 months ago
- ☆19May 31, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated 3 weeks ago
- Ant design Integration with django.☆11Dec 9, 2022Updated 3 years ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Aug 22, 2023Updated 2 years ago
- ☆12Jul 17, 2023Updated 2 years ago
- visual question answering prompting recipes for large vision-language models☆28Sep 14, 2024Updated last year
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆61Apr 8, 2024Updated 2 years ago