SamuelSchmidgall/AgentClinic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SamuelSchmidgall/AgentClinic)

SamuelSchmidgall / AgentClinic

Agent benchmark for medical diagnosis

☆339

Alternatives and similar repositories for AgentClinic

Users that are interested in AgentClinic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stanfordmlgroup / MedAgentBench
View on GitHub
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents
☆307Nov 21, 2025Updated 8 months ago
univanxx / 3mdbench
View on GitHub
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
☆24Sep 23, 2025Updated 10 months ago
mitmedialab / MDAgents
View on GitHub
Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
☆288Nov 10, 2024Updated last year
Wangyixinxin / MMedAgent
View on GitHub
Learning to Use Medical Tools with Multi-modal Agent
☆267Mar 18, 2026Updated 4 months ago
shan23chen / MedBrowseComp
View on GitHub
☆43May 22, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yhzhu99 / MedAgentBoard
View on GitHub
[NeurIPS 2025] MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks
☆59Mar 13, 2026Updated 4 months ago
UCSC-VLAA / o1_medical
View on GitHub
☆48Feb 26, 2025Updated last year
wshi83 / EhrAgent
View on GitHub
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
☆137Dec 26, 2024Updated last year
gersteinlab / MedicalAgentsBench
View on GitHub
[Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
☆83Mar 10, 2026Updated 4 months ago
gersteinlab / MedAgents
View on GitHub
[ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537
☆362May 27, 2024Updated 2 years ago
FreedomIntelligence / Chain-of-Diagnosis
View on GitHub
An interpretable large language model (LLM) for medical diagnosis.
☆164Sep 12, 2024Updated last year
mims-harvard / TxAgent
View on GitHub
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
☆646Jul 30, 2025Updated 11 months ago
ncbi-nlp / MedCalc-Bench
View on GitHub
[NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
☆93Dec 18, 2025Updated 7 months ago
LibertFan / AI_Hospital
View on GitHub
AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis
☆196Sep 13, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HanjieChen / ChallengeClinicalQA
View on GitHub
Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
☆50Jul 10, 2025Updated last year
AlaaLab / ER-Reason
View on GitHub
Official Codebase for "ER-Reason: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room"
☆24May 5, 2026Updated 2 months ago
ncbi-nlp / TrialGPT
View on GitHub
Code and data for TrialGPT.
☆165Jan 24, 2025Updated last year
UCSC-VLAA / MedReason
View on GitHub
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
☆280Jun 19, 2025Updated last year
eth-medical-ai-lab / Med-PRM
View on GitHub
[EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
☆68Sep 15, 2025Updated 10 months ago
TsinghuaC3I / MedXpertQA
View on GitHub
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
☆170Jul 17, 2025Updated last year
SamuelSchmidgall / Awesome-Surgical-Data-Science
View on GitHub
Papers from the intersection of surgery and data science / machine learning
☆18Jan 28, 2024Updated 2 years ago
WeixiangYAN / ClinicalLab
View on GitHub
[NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World
☆141Aug 18, 2024Updated last year
glee4810 / FHIR-AgentBench
View on GitHub
Code and Data for FHIR-AgentBench
☆26Dec 15, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JarvisUSTC / DoctorAgent-RL
View on GitHub
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
☆96Jan 23, 2026Updated 6 months ago
MAGIC-AI4Med / DiagGym
View on GitHub
A virtual clinical environment for self‑evolving LLM diagnostic agents.
☆108Feb 12, 2026Updated 5 months ago
jinlab-imvr / MedAgent-Pro
View on GitHub
[2026 ICLR] The official code for MedAgent_Pro
☆181May 12, 2026Updated 2 months ago
nec-research / meddxagent
View on GitHub
MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis
☆22Jun 13, 2025Updated last year
MAGIC-AI4Med / MedS-Ins
View on GitHub
[npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"
☆79May 5, 2025Updated last year
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆25May 12, 2026Updated 2 months ago
SamuelSchmidgall / GSViT
View on GitHub
Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"
☆51Apr 19, 2024Updated 2 years ago
DATEXIS / AMEGA-benchmark
View on GitHub
AMEGA-LLM: Autonomous Medical Evaluation for Guideline Adherence of Large Language Models
☆31Jun 10, 2026Updated last month
EPFLiGHT / FullyOpenMeditron
View on GitHub
We release Open Meditron, a fully open, clinician-audited medical training corpus and evaluation protocol that closes the open-vs-closed …
☆15May 15, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
paulhager / MIMIC-Clinical-Decision-Making-Dataset
View on GitHub
Code repository to create the MIMIC-CDM Dataset.
☆48Feb 7, 2025Updated last year
kevinwu23 / Stanford-MedCaseReasoning
View on GitHub
☆51Jun 2, 2025Updated last year
UCSC-VLAA / MedTrinity-25M
View on GitHub
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…
☆413Jul 11, 2025Updated last year
bowang-lab / MedRAX
View on GitHub
MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025
☆1,201Oct 31, 2025Updated 8 months ago
FreedomIntelligence / Awesome-LLM-Patient-Simulators
View on GitHub
A Paper collection for LLM based Patient Simulators
☆127Jan 7, 2026Updated 6 months ago
thu-unicorn / Doctor-R1
View on GitHub
This is the official repository for our paper "Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning" pu…
☆51Apr 11, 2026Updated 3 months ago
BlueZeros / AgentEHR
View on GitHub
Agentic System, Tool Use, Electronic Health Record, Large Language Models, Clinical Nature Language Processing
☆24Apr 13, 2026Updated 3 months ago