The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
☆25Feb 19, 2026Updated 4 months ago
Alternatives and similar repositories for MediConfusion
Users that are interested in MediConfusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RadGraph: Extracting Clinical Entities and Relations from Radiology Reports☆14Nov 22, 2022Updated 3 years ago
- Code to Implement the Smooth Euler Characteristic Transform (SECT)☆12Oct 22, 2019Updated 6 years ago
- A generalist foundation model for healthcare capable of handling diverse medical data modalities.☆98Apr 30, 2026Updated last month
- ☆25Nov 27, 2025Updated 6 months ago
- ☆71Jul 2, 2025Updated 11 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- MC-CoT implementation code☆23Jun 24, 2025Updated 11 months ago
- [ACCV2024 (Oral)] Official pytorch implementation of X-RGen☆18Jan 20, 2025Updated last year
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆19Feb 25, 2023Updated 3 years ago
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆126Jul 7, 2025Updated 11 months ago
- Implementation of the paper "CXR-IRGen: An Integrated Vision and Language Model for the Generation of Clinically Accurate Chest X-Ray Ima…☆21Jul 2, 2024Updated last year
- Codes for Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks☆13May 8, 2024Updated 2 years ago
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆233Dec 6, 2024Updated last year
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆48Jun 29, 2025Updated 11 months ago
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆47Dec 27, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.☆13Feb 2, 2026Updated 4 months ago
- The implementation of AICircuit: A Multi-Level Dataset and Benchmark for AI-Driven Analog Integrated Circuit Design☆96Jan 28, 2025Updated last year
- Chest X-Ray Explainer (ChEX)☆24Jan 30, 2025Updated last year
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆58Jan 22, 2026Updated 4 months ago
- [MICCAI'24] Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation☆26Apr 14, 2025Updated last year
- FFA Synthesis from CFP (ACM MM 2024 Workshop Best Paper Award)☆29Dec 13, 2024Updated last year
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆81Dec 4, 2024Updated last year
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆16Apr 15, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆94Apr 13, 2026Updated 2 months ago
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆161Jul 17, 2025Updated 11 months ago
- This repository demonstrates the utilization of UNETR for brain tumor segmentation.☆11Feb 23, 2024Updated 2 years ago
- This repository contains the pretraining code for the Pillar-0 model.☆36Jan 13, 2026Updated 5 months ago
- ☆39Mar 19, 2026Updated 3 months ago
- code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering☆29May 30, 2025Updated last year
- ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field☆189Oct 9, 2025Updated 8 months ago
- MSWAL☆15Nov 7, 2025Updated 7 months ago
- Developing VLMs for expert-level performance in specific medical specialties☆25Apr 25, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- Code for NeurIPS 2024 paper — Cross-Device Collaborative Test-Time Adaptation☆17Feb 28, 2025Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 11 months ago
- A Multitask Conversational Vision-Language Model for Radiology☆17Jul 3, 2025Updated 11 months ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆75Jun 5, 2025Updated last year
- Multiclass Segmentation using UNET on Crowd Instance-level Human Parsing (CHIP) dataset☆16Oct 28, 2022Updated 3 years ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆43May 20, 2025Updated last year