[NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
☆92Dec 18, 2025Updated 5 months ago
Alternatives and similar repositories for MedCalc-Bench
Users that are interested in MedCalc-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- REMed: Retrieval-Enhanced Medical prediction model☆23Jan 8, 2025Updated last year
- ☆48Feb 26, 2025Updated last year
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆100Feb 6, 2026Updated 4 months ago
- [NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"☆27Nov 21, 2025Updated 6 months ago
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆119Aug 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆86Jan 15, 2024Updated 2 years ago
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆16Jan 9, 2026Updated 5 months ago
- ☆14Aug 9, 2024Updated last year
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)☆95Feb 6, 2026Updated 4 months ago
- A Python Natural Language Processing Toolkit for Electronic Health Record Texts☆13May 24, 2023Updated 3 years ago
- UniHPF : Universal Healthcare Predictive Framework with Zero Domain Knowledge☆13Nov 16, 2023Updated 2 years ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆42Jun 23, 2024Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated 2 years ago
- Official repository of "EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records" (ACL 2024 Fi…☆17Jul 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Biomedical Question Answering Datasets.☆130Apr 30, 2025Updated last year
- Official repository of the MIRAGE benchmark☆209Feb 6, 2026Updated 4 months ago
- Code and data for Cell-o1.☆28Sep 19, 2025Updated 8 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆11Jun 18, 2024Updated last year
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆90Dec 2, 2023Updated 2 years ago
- PMC-Patients☆110Jun 7, 2024Updated 2 years ago
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆33May 13, 2026Updated last month
- ☆25Nov 27, 2025Updated 6 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆261Mar 24, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆96Sep 26, 2024Updated last year
- Clinically Adapted Model Enhanced from LLaMA☆89Sep 1, 2023Updated 2 years ago
- Code for AttentionMeSH☆17Oct 5, 2018Updated 7 years ago
- Simplify openEHR implementation on Java, Groovy and other JDK languages. By www.CaboLabs.com☆18May 2, 2026Updated last month
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)☆24Mar 6, 2025Updated last year
- CMB, A Comprehensive Medical Benchmark in Chinese☆243Mar 27, 2025Updated last year
- Benchmark, Toolbox, and Reflection-based Method for Clinical Agent☆22Nov 6, 2024Updated last year
- ☆25Jan 15, 2024Updated 2 years ago
- Code for the MedRAG toolkit☆568May 8, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated collection of cutting-edge research at the intersection of machine learning and healthcare. This repository will be actively ma…☆35May 31, 2026Updated 2 weeks ago
- OpenMedCalc is a free, open-source medical calculation API☆21Dec 24, 2025Updated 5 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated last year
- ☆73Feb 3, 2025Updated last year
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆37May 2, 2026Updated last month
- [Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆81Mar 10, 2026Updated 3 months ago
- Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.☆49Feb 7, 2025Updated last year