Agent benchmark for medical diagnosis
☆312Dec 31, 2024Updated last year
Alternatives and similar repositories for AgentClinic
Users that are interested in AgentClinic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆48Apr 19, 2024Updated 2 years ago
- ☆48Feb 26, 2025Updated last year
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆268Nov 10, 2024Updated last year
- [Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆80Mar 10, 2026Updated 2 months ago
- High-performance GPU-based simulation platform for reinforcement learning with surgical robot learning☆102Jun 20, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆41May 22, 2025Updated last year
- An interpretable large language model (LLM) for medical diagnosis.☆163Sep 12, 2024Updated last year
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…☆26May 12, 2026Updated 2 weeks ago
- A Graph RAG System for Evidenced-based Medical Information Retrieval [ACL 2025]☆782Oct 18, 2025Updated 7 months ago
- [EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation☆19Dec 11, 2024Updated last year
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆163Apr 7, 2026Updated last month
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆44Jun 4, 2025Updated 11 months ago
- [ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction☆57Oct 27, 2025Updated 7 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆88Dec 18, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated last year
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆48Jul 10, 2025Updated 10 months ago
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆270Nov 21, 2025Updated 6 months ago
- The official codes for "M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging"☆44Jul 28, 2025Updated 9 months ago
- Learning to Use Medical Tools with Multi-modal Agent☆255Mar 18, 2026Updated 2 months ago
- A list of papers that I liked.☆19Jul 8, 2022Updated 3 years ago
- A Paper collection for LLM based Patient Simulators☆115Jan 7, 2026Updated 4 months ago
- ☆47Nov 12, 2025Updated 6 months ago
- [EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning☆28Jun 12, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆28Feb 7, 2024Updated 2 years ago
- ☆21Aug 9, 2024Updated last year
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆345May 27, 2024Updated 2 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.☆2,196Jun 4, 2025Updated 11 months ago
- ☆17Sep 23, 2024Updated last year
- Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICL…☆32May 12, 2025Updated last year
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆119Aug 22, 2024Updated last year
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"☆47Apr 24, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning☆31Jun 8, 2023Updated 2 years ago
- Medical o1, Towards medical complex reasoning with LLMs☆1,316Jan 20, 2025Updated last year
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆67Sep 15, 2025Updated 8 months ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆272Jun 19, 2025Updated 11 months ago
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆135Aug 18, 2024Updated last year
- [NeurIPS 2025] PanTS: The Pancreatic Tumor Segmentation Dataset. PanTS enables development and external evaluation of AI for pancreatic t…☆110May 4, 2026Updated 3 weeks ago
- Papers from the intersection of surgery and data science / machine learning☆15Jan 28, 2024Updated 2 years ago