The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
☆25Feb 19, 2026Updated 3 months ago
Alternatives and similar repositories for MediConfusion
Users that are interested in MediConfusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)☆95Feb 6, 2026Updated 3 months ago
- RadGraph: Extracting Clinical Entities and Relations from Radiology Reports☆14Nov 22, 2022Updated 3 years ago
- ☆18Jul 20, 2025Updated 10 months ago
- Code to Implement the Smooth Euler Characteristic Transform (SECT)☆12Oct 22, 2019Updated 6 years ago
- A generalist foundation model for healthcare capable of handling diverse medical data modalities.☆98Apr 30, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆70Jul 2, 2025Updated 10 months ago
- MC-CoT implementation code☆23Jun 24, 2025Updated 11 months ago
- [ACCV2024 (Oral)] Official pytorch implementation of X-RGen☆18Jan 20, 2025Updated last year
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆19Feb 25, 2023Updated 3 years ago
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆122Jul 7, 2025Updated 10 months ago
- Codes for Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks☆12May 8, 2024Updated 2 years ago
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆233Dec 6, 2024Updated last year
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆46Jun 29, 2025Updated 11 months ago
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆47Dec 27, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.☆13Feb 2, 2026Updated 3 months ago
- A framework for Longitudinal Radiology Report Generation☆29Aug 10, 2024Updated last year
- Chest X-Ray Explainer (ChEX)☆24Jan 30, 2025Updated last year
- KAIST medical VL research group☆20Dec 20, 2024Updated last year
- ☆71Feb 3, 2025Updated last year
- FFA Synthesis from CFP (ACM MM 2024 Workshop Best Paper Award)☆27Dec 13, 2024Updated last year
- [MICCAI'24] Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation☆26Apr 14, 2025Updated last year
- ☆38Dec 8, 2025Updated 5 months ago
- Official implementation of paper "HiAE: A High-Throughput Authenticated Encryption Algorithm for Cross-Platfor Efficiency"☆19Nov 11, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆81Dec 4, 2024Updated last year
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆16Apr 15, 2026Updated last month
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆90Apr 13, 2026Updated last month
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆158Jul 17, 2025Updated 10 months ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆45Oct 18, 2025Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 3 months ago
- This repository demonstrates the utilization of UNETR for brain tumor segmentation.☆11Feb 23, 2024Updated 2 years ago
- Official code for "LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation"☆142Nov 11, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Nov 5, 2024Updated last year
- This repository contains the pretraining code for the Pillar-0 model.☆34Jan 13, 2026Updated 4 months ago
- ☆39Mar 19, 2026Updated 2 months ago
- code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering☆29May 30, 2025Updated last year
- ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field☆189Oct 9, 2025Updated 7 months ago
- MSWAL☆14Nov 7, 2025Updated 6 months ago
- Developing VLMs for expert-level performance in specific medical specialties☆25Apr 25, 2025Updated last year