Agent benchmark for medical diagnosis
☆301Dec 31, 2024Updated last year
Alternatives and similar repositories for AgentClinic
Users that are interested in AgentClinic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆261Nov 10, 2024Updated last year
- AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis☆193Sep 13, 2024Updated last year
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆80Mar 10, 2026Updated last month
- High-performance GPU-based simulation platform for reinforcement learning with surgical robot learning☆102Jun 20, 2025Updated 10 months ago
- ☆41May 22, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…☆26Feb 21, 2025Updated last year
- A Graph RAG System for Evidenced-based Medical Information Retrieval [ACL 2025]☆774Oct 18, 2025Updated 6 months ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆156Apr 7, 2026Updated 3 weeks ago
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆42Jun 4, 2025Updated 11 months ago
- [ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction☆56Oct 27, 2025Updated 6 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆89Dec 18, 2025Updated 4 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated last year
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆128Dec 26, 2024Updated last year
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆48Jul 10, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆262Nov 21, 2025Updated 5 months ago
- Learning to Use Medical Tools with Multi-modal Agent☆249Mar 18, 2026Updated last month
- PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)☆108Feb 17, 2026Updated 2 months ago
- A list of papers that I liked.☆19Jul 8, 2022Updated 3 years ago
- A Paper collection for LLM based Patient Simulators☆114Jan 7, 2026Updated 4 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆30Oct 28, 2025Updated 6 months ago
- Code and data for MedQA☆376Dec 1, 2022Updated 3 years ago
- ☆47Nov 12, 2025Updated 5 months ago
- ☆28Feb 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Aug 9, 2024Updated last year
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆342May 27, 2024Updated last year
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.☆2,182Jun 4, 2025Updated 11 months ago
- [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…☆410Jul 11, 2025Updated 9 months ago
- Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.☆49Feb 7, 2025Updated last year
- Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICL…☆31May 12, 2025Updated 11 months ago
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆118Aug 22, 2024Updated last year
- [NeurIPS 2025] PanTS: The Pancreatic Tumor Segmentation Dataset. PanTS enables development and external evaluation of AI for pancreatic t…☆104Mar 20, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning☆31Jun 8, 2023Updated 2 years ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆66Sep 15, 2025Updated 7 months ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆267Jun 19, 2025Updated 10 months ago
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆135Aug 18, 2024Updated last year
- Papers from the intersection of surgery and data science / machine learning☆15Jan 28, 2024Updated 2 years ago
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 5 months ago
- Constructing community of LLM-based Agent in the minecraft☆17Nov 27, 2025Updated 5 months ago