The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
☆25Feb 19, 2026Updated 2 months ago
Alternatives and similar repositories for MediConfusion
Users that are interested in MediConfusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RadGraph: Extracting Clinical Entities and Relations from Radiology Reports☆14Nov 22, 2022Updated 3 years ago
- ☆17Jul 20, 2025Updated 9 months ago
- A generalist foundation model for healthcare capable of handling diverse medical data modalities.☆97Apr 30, 2026Updated last week
- Code to Implement the Smooth Euler Characteristic Transform (SECT)☆12Oct 22, 2019Updated 6 years ago
- ☆70Jul 2, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MC-CoT implementation code☆22Jun 24, 2025Updated 10 months ago
- [ACCV2024 (Oral)] Official pytorch implementation of X-RGen☆19Jan 20, 2025Updated last year
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆19Feb 25, 2023Updated 3 years ago
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆120Jul 7, 2025Updated 10 months ago
- Codes for Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks☆12May 8, 2024Updated 2 years ago
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆233Dec 6, 2024Updated last year
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆45Jun 29, 2025Updated 10 months ago
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆47Dec 27, 2025Updated 4 months ago
- This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.☆13Feb 2, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A framework for Longitudinal Radiology Report Generation☆29Aug 10, 2024Updated last year
- The implementation of AICircuit: A Multi-Level Dataset and Benchmark for AI-Driven Analog Integrated Circuit Design☆90Jan 28, 2025Updated last year
- Chest X-Ray Explainer (ChEX)☆24Jan 30, 2025Updated last year
- KAIST medical VL research group☆20Dec 20, 2024Updated last year
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆56Jan 22, 2026Updated 3 months ago
- FFA Synthesis from CFP (ACM MM 2024 Workshop Best Paper Award)☆26Dec 13, 2024Updated last year
- ☆38Dec 8, 2025Updated 5 months ago
- Official implementation of paper "HiAE: A High-Throughput Authenticated Encryption Algorithm for Cross-Platfor Efficiency"☆19Nov 11, 2025Updated 5 months ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆80Dec 4, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆16Apr 15, 2026Updated 3 weeks ago
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆155Jul 17, 2025Updated 9 months ago
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆88Apr 13, 2026Updated 3 weeks ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆46Oct 18, 2025Updated 6 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 3 months ago
- This repository demonstrates the utilization of UNETR for brain tumor segmentation.☆11Feb 23, 2024Updated 2 years ago
- code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering☆29May 30, 2025Updated 11 months ago
- ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field☆188Oct 9, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for NeurIPS 2024 paper — Cross-Device Collaborative Test-Time Adaptation☆16Feb 28, 2025Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 10 months ago
- A Multitask Conversational Vision-Language Model for Radiology☆17Jul 3, 2025Updated 10 months ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆73Jun 5, 2025Updated 11 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆25Jan 31, 2025Updated last year
- [MICCAI 2024] RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features☆10Aug 22, 2025Updated 8 months ago
- (AAAI-2025 oral) LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts☆54Jun 12, 2025Updated 10 months ago