MShahabSepehri / MediConfusionLinks
The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
☆16Updated 3 months ago
Alternatives and similar repositories for MediConfusion
Users that are interested in MediConfusion are comparing it to the libraries listed below
Sorting:
- Chest X-Ray Explainer (ChEX)☆19Updated 4 months ago
- [EMNLP, Findings 2024] a radiology report generation metric that leverages the natural language understanding of language models to ident…☆47Updated last month
- ☆21Updated last year
- A Python tool to evaluate the performance of VLM on the medical domain.☆66Updated last month
- ☆43Updated 3 months ago
- Official code for the CHIL 2024 paper: "Vision-Language Generative Model for View-Specific Chest X-ray Generation"☆51Updated 6 months ago
- The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"☆25Updated 4 months ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆31Updated last month
- ☆34Updated last year
- ☆78Updated last year
- Radiology Language Evaluations☆10Updated last year
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".