TIO-IKIM / CLUELinks

CLUE: A Clinical Language Understanding Evaluation for LLMs

☆18

Alternatives and similar repositories for CLUE

Users that are interested in CLUE are comparing it to the libraries listed below

Sorting:

starmpcc / REMed
REMed: Retrieval-Enhanced Medical prediction model
☆19Updated 5 months ago
Google-Health / med-gemini-medqa-relabelling
For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.
☆49Updated last year
abachaa / MEDEC
☆37Updated 3 weeks ago
DATEXIS / AMEGA-benchmark
AMEGA-LLM: Autonomous Medical Evaluation for Guideline Adherence of Large Language Models
☆18Updated last month
HanjieChen / ChallengeClinicalQA
Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
☆38Updated 10 months ago
starmpcc / Asclepius
Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"
☆106Updated 10 months ago
zzachw / llemr
NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records
☆36Updated 6 months ago
google-deepmind / codoc
☆115Updated last year
StanfordMIMI / clin-summ
Clinical text summarization by adapting large language models
☆143Updated 10 months ago
stanfordmlgroup / MedAgentBench
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents
☆84Updated 4 months ago
glee4810 / ehrsql-2024
Clinical NLP Shared Task @ NAACL'24
☆33Updated last week
som-shahlab / medalign
MedAlign is a clinician-generated dataset for instruction following with electronic medical records.
☆94Updated last month
som-shahlab / ehrshot-benchmark
A benchmark for few-shot evaluation of foundation models for electronic health records (EHRs)
☆176Updated 3 weeks ago
paulhager / MIMIC-Clinical-Decision-Making-Framework
Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.
☆38Updated 4 months ago
dustn1259 / EHRCon
Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
☆22Updated 10 months ago
JoakimEdin / explainable-medical-coding
☆17Updated last month
MadhumitaSushil / OncLLMExtraction
Expert-Curated Oncology Reports to Advance Language Model Inference
☆29Updated last year
XZhang97666 / AlpaCare
☆91Updated 4 months ago
mila-iqia / ddxplus
☆85Updated 2 years ago
som-shahlab / INSPECT_public
INSPECT dataset/benchmark paper, accepted by NeurIPS 2023
☆33Updated last month
rajpurkarlab / BenchMD
☆81Updated 2 years ago
nyuolab / NYUTron
public code repository for paper "Health system scale language models are general purpose clinical prediction engines"
☆113Updated last year
starmpcc / CAMEL
Clinically Adapted Model Enhanced from LLaMA
☆84Updated last year
ipolharvard / ethos-paper
☆72Updated 3 months ago
som-shahlab / hf_ehr
Training HuggingFace models on EHR data
☆26Updated 3 weeks ago
hiesingerlab / almanac-retrieval
Almanac: Retrieval-Augmented Language Models for Clinical Medicine
☆33Updated last year
wshi83 / EhrAgent
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
☆100Updated 6 months ago
NYUMedML / headCT_foundation
Foundation 3D ViT model for volumetric head CT
☆40Updated 2 months ago
mauro-nievoff / MultiCaRe_Dataset
Repo about the MultiCaRe Dataset, with demo notebooks and details about how it was created.
☆43Updated 3 months ago
mlcommons / medperf
An open benchmarking platform for medical artificial intelligence using Federated Evaluation.
☆158Updated this week