Agent benchmark for medical diagnosis
☆280Dec 31, 2024Updated last year
Alternatives and similar repositories for AgentClinic
Users that are interested in AgentClinic are comparing it to the libraries listed below
Sorting:
- ☆48Feb 26, 2025Updated last year
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆238Nov 10, 2024Updated last year
- ☆41May 22, 2025Updated 9 months ago
- AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis☆186Sep 13, 2024Updated last year
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆76Oct 10, 2025Updated 4 months ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆46Apr 19, 2024Updated last year
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…☆17Dec 11, 2024Updated last year
- A Graph RAG System for Evidenced-based Medical Information Retrieval [ACL 2025]☆737Oct 18, 2025Updated 4 months ago
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆229Nov 21, 2025Updated 3 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆77May 5, 2025Updated 10 months ago
- Constructing community of LLM-based Agent in the minecraft☆16Nov 27, 2025Updated 3 months ago
- Learning to Use Medical Tools with Multi-modal Agent☆230Feb 7, 2026Updated 3 weeks ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- ☆15Sep 23, 2024Updated last year
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆33Jun 4, 2025Updated 9 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 4 months ago
- ☆46Nov 12, 2025Updated 3 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆126Dec 26, 2024Updated last year
- Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.☆48Feb 7, 2025Updated last year
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆114Aug 22, 2024Updated last year
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆28Jun 12, 2025Updated 8 months ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆130Jan 31, 2026Updated last month
- Medical reasoning using large language models☆92Jan 9, 2024Updated 2 years ago
- EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning☆32Jun 8, 2023Updated 2 years ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆255Jun 19, 2025Updated 8 months ago
- [ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction☆54Oct 27, 2025Updated 4 months ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆60Sep 15, 2025Updated 5 months ago
- Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICL…☆27May 12, 2025Updated 9 months ago
- [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…☆401Jul 11, 2025Updated 7 months ago
- Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.☆2,136Jun 4, 2025Updated 9 months ago
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆324May 27, 2024Updated last year
- MedEvalKit: A Unified Medical Evaluation Framework☆211Feb 24, 2026Updated last week
- The repository for "MedChain: Bridging the Gap Between LLM Agents and Real-World Clinical Decision Making"☆44Oct 10, 2025Updated 4 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆39Jun 4, 2025Updated 9 months ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Mar 26, 2024Updated last year
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆126Aug 18, 2024Updated last year
- Medical o1, Towards medical complex reasoning with LLMs☆1,284Jan 20, 2025Updated last year
- ☆21Aug 9, 2024Updated last year
- ☆32Oct 18, 2024Updated last year