TsinghuaC3I/MedXpertQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TsinghuaC3I/MedXpertQA)

TsinghuaC3I / MedXpertQA

[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

☆171

Alternatives and similar repositories for MedXpertQA

Users that are interested in MedXpertQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uni-medical / GMAI-VL-R1
View on GitHub
☆19Jul 21, 2025Updated last year
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 8 months ago
zzma2 / medical-llm-reasoning-survey
View on GitHub
A curated list of medical reasoning research on large language models, organized by modality, technique, application, and benchmark.
☆19Oct 17, 2025Updated 9 months ago
gersteinlab / MedicalAgentsBench
View on GitHub
[Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
☆82Mar 10, 2026Updated 4 months ago
UCSC-VLAA / MedReason
View on GitHub
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
☆280Jun 19, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
haojinw0027 / MedFrameQA
View on GitHub
MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning
☆18Jun 6, 2025Updated last year
sunanhe / MedDr
View on GitHub
A generalist foundation model for healthcare capable of handling diverse medical data modalities.
☆100Apr 30, 2026Updated 2 months ago
liujiyaoFDU / MedQ-Bench
View on GitHub
☆30Oct 13, 2025Updated 9 months ago
alibaba-damo-academy / MedEvalKit
View on GitHub
MedEvalKit: A Unified Medical Evaluation Framework
☆247Feb 24, 2026Updated 5 months ago
aiming-lab / MMedPO
View on GitHub
[ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
☆74Jun 5, 2025Updated last year
function2-llx / MMMM
View on GitHub
[NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
☆31Mar 10, 2025Updated last year
FreedomIntelligence / HuatuoGPT-Vision
View on GitHub
Medical Multimodal LLMs
☆398Apr 23, 2025Updated last year
richard-peng-xia / MMed-RAG
View on GitHub
[ICLR'25] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
☆337Jan 22, 2025Updated last year
DDVD233 / QoQ_Med
View on GitHub
☆52Jul 31, 2025Updated 11 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
gzxiong / MedRAG
View on GitHub
Code for the MedRAG toolkit
☆580May 8, 2025Updated last year
baeseongsu / ehrxqa
View on GitHub
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)
☆98Feb 6, 2026Updated 5 months ago
shan23chen / MedBrowseComp
View on GitHub
☆43May 22, 2025Updated last year
HanjieChen / ChallengeClinicalQA
View on GitHub
Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
☆50Jul 10, 2025Updated last year
mitmedialab / MDAgents
View on GitHub
Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
☆288Nov 10, 2024Updated last year
Holipori / MIMIC-Diff-VQA
View on GitHub
☆73Feb 3, 2025Updated last year
baeseongsu / mimic-cxr-vqa
View on GitHub
A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…
☆100Feb 6, 2026Updated 5 months ago
MAGIC-AI4Med / MedRBench
View on GitHub
[Nature Communications] The official code for "Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases".
☆70Nov 7, 2025Updated 8 months ago
alibaba-damo-academy / ReasonMed
View on GitHub
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
☆122Oct 28, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
eth-medical-ai-lab / Med-PRM
View on GitHub
[EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
☆69Sep 15, 2025Updated 10 months ago
MrGiovanni / ScaleMAI
View on GitHub
☆24Jan 11, 2025Updated last year
UCSC-VLAA / o1_medical
View on GitHub
☆48Feb 26, 2025Updated last year
Wangyixinxin / MMedAgent
View on GitHub
Learning to Use Medical Tools with Multi-modal Agent
☆267Mar 18, 2026Updated 4 months ago
UCSC-VLAA / MedTrinity-25M
View on GitHub
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…
☆412Jul 11, 2025Updated last year
Xu-Huihui / MedGround-R1
View on GitHub
Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…
☆42Sep 28, 2025Updated 10 months ago
MrGiovanni / RadGPT
View on GitHub
[ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …
☆215Dec 31, 2025Updated 6 months ago
corentin-ryr / MultiMedEval
View on GitHub
A Python tool to evaluate the performance of VLM on the medical domain.
☆89Aug 5, 2025Updated 11 months ago
Awenbocc / GEMeX-Project
View on GitHub
Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]
☆48Jun 29, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kevinwu23 / Stanford-MedCaseReasoning
View on GitHub
☆51Jun 2, 2025Updated last year
Stanford-AIMI / CheXagent
View on GitHub
[Arxiv-2024] CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation
☆230Jan 7, 2025Updated last year
Google-Health / medsiglip
View on GitHub
☆298Jun 11, 2026Updated last month
Schuture / Quality-Sentinel
View on GitHub
This is the repository of Quality Sentinel, a label quality evaluation model for medical image segmentation.
☆22Dec 3, 2025Updated 7 months ago
baeseongsu / Clinical-LLM-FineTuning-HandsOn
View on GitHub
Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials
☆17Jul 10, 2026Updated 2 weeks ago
BAAI-DCAI / M3D
View on GitHub
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
☆454Apr 13, 2025Updated last year
Yuxiang-Lai117 / Med-R1
View on GitHub
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
☆129Jul 7, 2025Updated last year