eric-ai-lab / ProbMed
"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"
☆17Updated 2 months ago
Alternatives and similar repositories for ProbMed
Users that are interested in ProbMed are comparing it to the libraries listed below
Sorting:
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆40Updated last week
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆44Updated last year
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆20Updated last month
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆26Updated last month
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imaging☆32Updated 4 months ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆21Updated 3 weeks ago
- ☆30Updated 6 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆13Updated 9 months ago
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆27Updated last year
- MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆18Updated 2 months ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆83Updated last year
- ☆32Updated 3 months ago
- LLaVa Version of RaDialog☆20Updated last month
- Expert-level AI radiology report evaluator☆29Updated last month
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆58Updated last month
- ☆48Updated 2 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆30Updated 3 weeks ago
- ☆18Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆13Updated 5 months ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Updated 10 months ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- ☆20Updated 3 months ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆31Updated 7 months ago
- ☆45Updated 4 months ago
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision☆41Updated last month
- The official code for MedAgent_Pro☆24Updated 3 weeks ago
- ☆33Updated last year
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆13Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆27Updated 3 weeks ago