eric-ai-lab / ProbMed
"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ProbMed
- ViLLA: Fine-grained vision-language representation learning from real-world data☆40Updated last year
- ☆22Updated 6 months ago
- ☆13Updated this week
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆24Updated 6 months ago
- ☆24Updated last month
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆15Updated last week
- [NeurIPS'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆56Updated last month
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆74Updated 6 months ago
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆30Updated last month
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆21Updated last year
- ☆11Updated 2 months ago
- [EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models☆40Updated last month
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆16Updated 5 months ago
- ☆32Updated this week
- NeurIPS 2024 (spotlight): A Textbook Remedy for Domain Shifts Knowledge Priors for Medical Image Analysis☆20Updated last month
- ☆36Updated last month
- ☆31Updated last month
- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding☆30Updated last week
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Updated last year
- Code for paper: VL-ICL Bench: The Devil in the Details of Benchmarking Multimodal In-Context Learning☆29Updated 7 months ago
- Chest X-Ray Explainer (ChEX)☆10Updated 3 months ago
- ☆13Updated last month
- ☆13Updated 3 weeks ago
- The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆41Updated last week
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆28Updated 8 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆11Updated 4 months ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆11Updated 9 months ago
- ☆19Updated last year
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆49Updated last month
- Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.☆52Updated 2 months ago