wshi83/MedAgentGym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wshi83/MedAgentGym)

wshi83 / MedAgentGym

[ICLR'26] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale

☆124

Alternatives and similar repositories for MedAgentGym

Users that are interested in MedAgentGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ritaranx / BMRetriever
View on GitHub
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆26Sep 19, 2024Updated last year
ritaranx / ClinGen
View on GitHub
[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…
☆43Jun 23, 2024Updated 2 years ago
wshi83 / MedAdapter
View on GitHub
[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning
☆37Dec 26, 2024Updated last year
ritaranx / RAM-EHR
View on GitHub
[ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.
☆42Sep 19, 2024Updated last year
jerry3027 / PolyIE
View on GitHub
☆17Jan 26, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wshi83 / EhrAgent
View on GitHub
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
☆137Dec 26, 2024Updated last year
PKU-AICare / ColaCare
View on GitHub
[WWW-2025] ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration
☆33Oct 11, 2025Updated 9 months ago
MAGIC-AI4Med / DiagGym
View on GitHub
A virtual clinical environment for self‑evolving LLM diagnostic agents.
☆108Feb 12, 2026Updated 5 months ago
alibaba-damo-academy / ReasonMed
View on GitHub
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
☆121Oct 28, 2025Updated 8 months ago
MAGIC-AI4Med / RadABench
View on GitHub
The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"
☆29Jan 22, 2025Updated last year
ritaranx / AceSearcher
View on GitHub
This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…
☆25Sep 29, 2025Updated 9 months ago
ritaranx / NeST
View on GitHub
[AAAI 2023] This is the code for our paper `Neighborhood-Regularized Self-Training for Learning with Few Labels'.
☆12Jan 11, 2023Updated 3 years ago
ritaranx / CACHE
View on GitHub
[ML4H 2022] This is the code for our paper `Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions …
☆27Feb 6, 2024Updated 2 years ago
gersteinlab / MedicalAgentsBench
View on GitHub
[Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
☆83Mar 10, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
XinhengLyu / WSI-Agents
View on GitHub
☆49Apr 28, 2026Updated 2 months ago
LijunRio / AG-KD
View on GitHub
This repository contains the code for our paper: Enhancing Abnormality Grounding for Vision-Language Models with Knowledge Descriptions
☆19Jun 24, 2025Updated last year
night-chen / HYDRA
View on GitHub
Official Code Repository for paper "HYDRA: Model Factorization Framework for Black-Box LLM Personalization"
☆16Oct 7, 2024Updated last year
MAGIC-AI4Med / DeepRare
View on GitHub
Code implementation of DeepRare (Nature 2026)
☆271Apr 14, 2026Updated 3 months ago
scott-yjyang / MeWM
View on GitHub
[ICCV 2025] Medical World Model
☆163Updated this week
ljwztc / MedChain
View on GitHub
The repository for "MedChain: Bridging the Gap Between LLM Agents and Real-World Clinical Decision Making"
☆55Apr 8, 2026Updated 3 months ago
SII-zyj / Ophiuchus
View on GitHub
☆26May 15, 2026Updated 2 months ago
hetergraphforbankruptcypredict / HAT
View on GitHub
heterogeneous graph attention network for SMEs bankruptcy prediction
☆13Feb 26, 2021Updated 5 years ago
dustn1259 / EHRCon
View on GitHub
Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
☆31Aug 21, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
UCSC-VLAA / MedReason
View on GitHub
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
☆280Jun 19, 2025Updated last year
alibaba-damo-academy / MedEvalKit
View on GitHub
MedEvalKit: A Unified Medical Evaluation Framework
☆247Feb 24, 2026Updated 5 months ago
Graph-and-Geometric-Learning / STFlow
View on GitHub
Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching, ICML2025 (Spotlight)
☆36Aug 11, 2025Updated 11 months ago
WeixiangYAN / ClinicalLab
View on GitHub
[NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World
☆141Aug 18, 2024Updated last year
nec-research / meddxagent
View on GitHub
MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis
☆22Jun 13, 2025Updated last year
GregxmHu / OccuBench
View on GitHub
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
☆21Apr 14, 2026Updated 3 months ago
UCSC-VLAA / m1
View on GitHub
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models
☆51Dec 21, 2025Updated 7 months ago
PKU-AICare / ConfAgents
View on GitHub
ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis
☆15Updated this week
FreedomIntelligence / Med-MAT
View on GitHub
[ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging
☆40Jun 4, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
FreedomIntelligence / SepsisAgent
View on GitHub
Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model
☆30May 15, 2026Updated 2 months ago
MrGiovanni / CARE
View on GitHub
[NeurIPS 2025] Completeness-Aware Reconstruction Enhancement
☆37Oct 18, 2025Updated 9 months ago
snu-cdrc / gencube
View on GitHub
Efficient retrieval, download, and unification of genomic data from leading biodiversity databases
☆18Updated this week
AgenticHealthAI / Awesome-AI-Agents-for-Healthcare
View on GitHub
Latest Advances on Agentic AI & AI Agents for Healthcare
☆1,180Updated this week
TsinghuaC3I / MedXpertQA
View on GitHub
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
☆170Jul 17, 2025Updated last year
MAGIC-AI4Med / EHR-R1
View on GitHub
☆37May 18, 2026Updated 2 months ago
som-shahlab / medalign
View on GitHub
MedAlign is a clinician-generated dataset for instruction following with electronic medical records.
☆102May 17, 2025Updated last year