ncbi-nlp/MedCalc-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ncbi-nlp/MedCalc-Bench)

ncbi-nlp / MedCalc-Bench

[NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations

☆93

Alternatives and similar repositories for MedCalc-Bench

Users that are interested in MedCalc-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shan23chen / Cross-Care
View on GitHub
Cross-Care
☆11Jun 24, 2024Updated 2 years ago
seonhee99 / EHR-SeqSQL
View on GitHub
Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Fi…
☆17Jul 5, 2024Updated 2 years ago
starmpcc / REMed
View on GitHub
REMed: Retrieval-Enhanced Medical prediction model
☆24Jan 8, 2025Updated last year
UCSC-VLAA / o1_medical
View on GitHub
☆48Feb 26, 2025Updated last year
baeseongsu / mimic-cxr-vqa
View on GitHub
A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…
☆100Feb 6, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MediaBrain-SJTU / GenMedicalEval
View on GitHub
☆86Jan 15, 2024Updated 2 years ago
mmcdermott / AUC_is_all_you_need
View on GitHub
Analyzing different ML model comparison metrics
☆17Jan 20, 2024Updated 2 years ago
tdlhl / RAD
View on GitHub
[NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"
☆27Nov 21, 2025Updated 8 months ago
starmpcc / Asclepius
View on GitHub
Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"
☆121Aug 22, 2024Updated last year
Itaymanes / K-QA
View on GitHub
Dataset and Evaluation Code for the K-QA Benchmark.
☆18May 26, 2024Updated 2 years ago
alexgoodell / open-med-calc
View on GitHub
OpenMedCalc is a free, open-source medical calculation API
☆23Dec 24, 2025Updated 7 months ago
baeseongsu / Clinical-LLM-FineTuning-HandsOn
View on GitHub
Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials
☆17Jul 10, 2026Updated 2 weeks ago
Jwoo5 / integrated-ehr-pipeline
View on GitHub
☆14Aug 9, 2024Updated last year
baeseongsu / ehrxqa
View on GitHub
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)
☆98Feb 6, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zzachw / llemr
View on GitHub
NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records
☆61Jul 17, 2026Updated last week
paulhager / MIMIC-Clinical-Decision-Making-Framework
View on GitHub
Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.
☆49Feb 7, 2025Updated last year
WeixiangYAN / ClinicalLab
View on GitHub
[NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World
☆141Aug 18, 2024Updated last year
ncbi-nlp / Clinical-Tool-Learning
View on GitHub
☆27Aug 10, 2025Updated 11 months ago
gzxiong / MIRAGE
View on GitHub
Official repository of the MIRAGE benchmark
☆209Feb 6, 2026Updated 5 months ago
IreneZihuiLi / EHRKit-2022
View on GitHub
A Python Natural Language Processing Toolkit for Electronic Health Record Texts
☆13May 24, 2023Updated 3 years ago
hoon9405 / UniHPF
View on GitHub
UniHPF : Universal Healthcare Predictive Framework with Zero Domain Knowledge
☆13Nov 16, 2023Updated 2 years ago
thomaswei-cn / MC-CoT
View on GitHub
MC-CoT implementation code
☆23Jun 24, 2025Updated last year
ritaranx / ClinGen
View on GitHub
[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…
☆43Jun 23, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yiqingxyq / DocLens
View on GitHub
Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)
☆22May 18, 2024Updated 2 years ago
abachaa / Existing-Medical-QA-Datasets
View on GitHub
Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems
☆317Oct 17, 2023Updated 2 years ago
drqiaojin / biomedical-qa-datasets
View on GitHub
Biomedical Question Answering Datasets.
☆131Apr 30, 2025Updated last year
pmc-patients / pmc-patients
View on GitHub
PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems…
☆83Dec 20, 2023Updated 2 years ago
ncbi-nlp / cell-o1
View on GitHub
Code and data for Cell-o1.
☆29Updated this week
AQ-MedAI / PulseMind
View on GitHub
☆20Jan 28, 2026Updated 5 months ago
lapisrocks / DiscreteAdversarialDistillation
View on GitHub
[NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"
☆11Jun 18, 2024Updated 2 years ago
zhao-zy15 / PMC-Patients
View on GitHub
PMC-Patients
☆113Jun 7, 2024Updated 2 years ago
gersteinlab / MedicalAgentsBench
View on GitHub
[Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
☆83Mar 10, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
mingze-yuan / Awesome-LLM-Healthcare
View on GitHub
The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assis…
☆269Dec 23, 2023Updated 2 years ago
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
abachaa / MeQSum
View on GitHub
Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…
☆33May 13, 2026Updated 2 months ago
zhao-zy15 / RareArena
View on GitHub
A Comprehensive Rare Disease Diagnostic Dataset with nearly 50,000 patients covering more than 4000 diseases
☆49Mar 13, 2026Updated 4 months ago
RyanWangZf / PromptEHR
View on GitHub
EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning
☆31Jun 8, 2023Updated 3 years ago
ncbi / MedCPT
View on GitHub
Code for MedCPT, a model for zero-shot biomedical information retrieval.
☆270Mar 24, 2024Updated 2 years ago