eric-ai-lab / ProbMed
"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"
β16Updated 2 months ago
Alternatives and similar repositories for ProbMed:
Users that are interested in ProbMed are comparing it to the libraries listed below
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β20Updated last month
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imagingβ32Updated 3 months ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world dataβ42Updated last year
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"β20Updated this week
- β19Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuningβ82Updated 11 months ago
- MRGen: Segmentation Data Engine for Underrepresented MRI Modalitiesβ18Updated last month
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).β27Updated 11 months ago
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervisionβ40Updated last month
- β29Updated 6 months ago
- An Enhanced CLIP Framework for Learning with Synthetic Captionsβ28Updated last week
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Desβ¦β55Updated 9 months ago
- Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"β25Updated 7 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Modelsβ24Updated last week
- β48Updated last month
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And Moreβ17Updated 2 months ago
- β32Updated last year
- β11Updated 2 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICβ¦β16Updated 2 months ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.β13Updated last year
- β45Updated 11 months ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Updated last year
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)β23Updated 2 years ago
- β18Updated last year
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202β¦β25Updated last month
- β31Updated 3 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'β13Updated 9 months ago
- β20Updated 2 months ago
- Official Code of IdealGPTβ35Updated last year
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Modelsβ75Updated 7 months ago