AI-in-Health / ClinicBench
Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark" (EMNLP2024)
☆14Updated 2 months ago
Alternatives and similar repositories for ClinicBench:
Users that are interested in ClinicBench are comparing it to the libraries listed below
- Code for "Retrieve, Reason, and Refine: Generating Accurate and Faithful Discharge/Patient Instructions" (NeurIPS 2022)☆30Updated 6 months ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆12Updated 5 months ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆14Updated 8 months ago
- [NeurIPS 2024 D&B] Official code for "EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries"☆29Updated 2 weeks ago
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images, NeurIPS 2023 D&B☆68Updated 6 months ago
- ☆25Updated last year
- Expert-Curated Oncology Reports to Advance Language Model Inference☆27Updated 9 months ago
- Clinical NLP Shared Task @ NAACL'24☆28Updated 3 weeks ago
- ☆57Updated last year
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆40Updated 2 weeks ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆76Updated 5 months ago
- Code repository to create the MIMIC-CDM Dataset.☆25Updated 3 weeks ago
- Official PyTorch implementation of https://arxiv.org/abs/2210.06340 (NeurIPS ‘22)☆19Updated 2 years ago
- ☆49Updated 2 years ago
- UniHPF : Universal Healthcare Predictive Framework with Zero Domain Knowledge☆12Updated last year
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆14Updated last year
- "Knowledge Graph-based Question Answering with Electronic Health Records", accepted at Machine Learning for Health Care (MLHC) 2021☆32Updated 11 months ago
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆41Updated last year
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆25Updated 8 months ago
- Biomedical Question Answering Datasets.☆87Updated last year
- [NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records☆75Updated 2 months ago
- REMed: Retrieval-Enhanced Medical prediction model☆15Updated 3 weeks ago
- NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records☆25Updated last month
- ☆22Updated 2 years ago
- Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…☆26Updated last year
- Source implementation and pointer to pre-trained models for Chexpert++, a BERT-based approximation to CheXpert for radiology report label…☆20Updated 4 years ago
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆52Updated 3 months ago
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)☆11Updated this week
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆74Updated last month
- auto icd coding with prompt☆47Updated 8 months ago