TsinghuaC3I / MedXpertQAView external linksLinks
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
☆142Jul 17, 2025Updated 6 months ago
Alternatives and similar repositories for MedXpertQA
Users that are interested in MedXpertQA are comparing it to the libraries listed below
Sorting:
- ☆21Nov 27, 2025Updated 2 months ago
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)☆91Feb 6, 2026Updated last week
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆45Oct 18, 2025Updated 3 months ago
- A generalist foundation model for healthcare capable of handling diverse medical data modalities.☆92Apr 25, 2024Updated last year
- MedEvalKit: A Unified Medical Evaluation Framework☆210Oct 23, 2025Updated 3 months ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆60Sep 15, 2025Updated 5 months ago
- ☆25Feb 6, 2026Updated last week
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Jun 4, 2025Updated 8 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 9 months ago
- [ICLR'25] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models☆301Jan 22, 2025Updated last year
- The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"☆28Jan 22, 2025Updated last year
- ☆69Feb 3, 2025Updated last year
- ☆48Feb 26, 2025Updated 11 months ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆96Feb 6, 2026Updated last week
- [NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine☆28Mar 10, 2025Updated 11 months ago
- [ICCV'25 Highlight] Derm1M: A Million‑Scale Vision‑Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology☆59Dec 5, 2025Updated 2 months ago
- A Python tool to evaluate the performance of VLM on the medical domain.☆83Aug 5, 2025Updated 6 months ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Nov 5, 2024Updated last year
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Feb 21, 2025Updated 11 months ago
- ☆20Jan 3, 2025Updated last year
- The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical found…☆24Nov 21, 2025Updated 2 months ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆421Apr 13, 2025Updated 10 months ago
- Medical Multimodal LLMs☆372Apr 23, 2025Updated 9 months ago
- ☆11Sep 16, 2021Updated 4 years ago
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆48Jul 10, 2025Updated 7 months ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆18Aug 28, 2025Updated 5 months ago
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆104Jul 7, 2025Updated 7 months ago
- ☆25Jan 11, 2025Updated last year
- This is the repository of Quality Sentinel, a label quality evaluation model for medical image segmentation.☆22Dec 3, 2025Updated 2 months ago
- [ACMMM-2022] This is the official implementation of Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Know…☆38Dec 14, 2022Updated 3 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆27Jan 10, 2026Updated last month
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆85Jun 4, 2025Updated 8 months ago
- ☆67Oct 31, 2025Updated 3 months ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆67Jun 5, 2025Updated 8 months ago
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆42Jun 29, 2025Updated 7 months ago
- SAM-Med2D: Bridging the Gap between Natural Image Segmentation and Medical Image Segmentation☆75Nov 19, 2023Updated 2 years ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆255Jun 19, 2025Updated 7 months ago
- ☆41Jan 28, 2026Updated 2 weeks ago
- ☆43Oct 20, 2023Updated 2 years ago