eric-ai-lab / ProbMed
"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"
β15Updated last month
Alternatives and similar repositories for ProbMed:
Users that are interested in ProbMed are comparing it to the libraries listed below
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β16Updated last week
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).β25Updated 10 months ago
- LLaVa Version of RaDialogβ17Updated last month
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world dataβ41Updated last year
- β27Updated 11 months ago
- β19Updated last month
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)β23Updated 2 years ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuningβ81Updated 10 months ago
- Official Code of IdealGPTβ34Updated last year
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imagingβ32Updated 2 months ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And Moreβ17Updated last month
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.β46Updated 3 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Modelsβ75Updated 6 months ago
- Expert-level AI radiology report evaluatorβ21Updated last week
- β44Updated last month
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Modelsβ13Updated last month
- β42Updated last year
- β31Updated last year
- β29Updated 5 months ago
- An Enhanced CLIP Framework for Learning with Synthetic Captionsβ28Updated 3 months ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"β18Updated 2 weeks ago
- Preference Learning for LLaVAβ41Updated 4 months ago
- MRGen: Segmentation Data Engine for Underrepresented MRI Modalitiesβ17Updated 2 weeks ago
- BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literatureβ49Updated this week
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"β33Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Desβ¦β55Updated 8 months ago
- Code implementation of RP3D-Diagβ15Updated 4 months ago
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervisionβ36Updated this week
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'β13Updated 8 months ago