Agent benchmark for medical diagnosis
☆322Dec 31, 2024Updated last year
Alternatives and similar repositories for AgentClinic
Users that are interested in AgentClinic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆50Apr 19, 2024Updated 2 years ago
- ☆48Feb 26, 2025Updated last year
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆274Nov 10, 2024Updated last year
- [Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆81Mar 10, 2026Updated 3 months ago
- ☆42May 22, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…☆26May 12, 2026Updated last month
- A Graph RAG System for Evidenced-based Medical Information Retrieval [ACL 2025]☆796Oct 18, 2025Updated 7 months ago
- [EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation☆19Dec 11, 2024Updated last year
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆170Apr 7, 2026Updated 2 months ago
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆50Jun 4, 2025Updated last year
- [ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction☆57Oct 27, 2025Updated 7 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆92Dec 18, 2025Updated 5 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated last year
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆49Jul 10, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆136Dec 26, 2024Updated last year
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆280Nov 21, 2025Updated 6 months ago
- Learning to Use Medical Tools with Multi-modal Agent☆256Mar 18, 2026Updated 2 months ago
- PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)☆113Feb 17, 2026Updated 3 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆31Oct 28, 2025Updated 7 months ago
- Code and data for MedQA☆384Dec 1, 2022Updated 3 years ago
- ☆47Nov 12, 2025Updated 7 months ago
- [EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning☆28Jun 12, 2025Updated last year
- ☆28Feb 7, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Aug 9, 2024Updated last year
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆348May 27, 2024Updated 2 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- ☆17Sep 23, 2024Updated last year
- Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.☆2,209Jun 4, 2025Updated last year
- [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…☆409Jul 11, 2025Updated 11 months ago
- Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.☆49Feb 7, 2025Updated last year
- Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICL…☆32May 12, 2025Updated last year
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆21Dec 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆119Aug 22, 2024Updated last year
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"☆49Apr 24, 2025Updated last year
- EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning☆31Jun 8, 2023Updated 3 years ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆68Sep 15, 2025Updated 9 months ago
- Medical o1, Towards medical complex reasoning with LLMs☆1,331Jan 20, 2025Updated last year
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆277Jun 19, 2025Updated 11 months ago
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆138Aug 18, 2024Updated last year