llmeval/LLMEval-Med

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/llmeval/LLMEval-Med)

llmeval / LLMEval-Med

[EMNLP 2025] A real-world clinical benchmark for medical LLMs with physician validation — 2,996 questions from EHRs

☆28

Alternatives and similar repositories for LLMEval-Med

Users that are interested in LLMEval-Med are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AQ-MedAI / MedicalAiBenchEval
View on GitHub
A comprehensive medical AI evaluation framework based on GAPS methodology. Features automated assessment pipeline, thoracic surgery datas…
☆44Nov 3, 2025Updated 8 months ago
Ananyaiitbhilai / Text2Triple-LLM-Agent
View on GitHub
[ESWC '24] This repo is official implementation for the paper "Towards Harnessing Large Language Models as Autonomous Agents for Semantic…
☆10May 25, 2024Updated 2 years ago
abachaa / MEDEC
View on GitHub
☆48Jul 17, 2026Updated last week
Medical-Event-Data-Standard / MIMIC_IV_MEDS
View on GitHub
The MIMIC-IV MEDS ETL
☆23Updated this week
believewhat / Dr.NoteAid
View on GitHub
ACL Workshop 2023
☆15Jan 3, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
leebird / bionlp17
View on GitHub
Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction
☆11Oct 25, 2017Updated 8 years ago
Joinn99 / RocketEval-ICLR
View on GitHub
🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist
☆17Aug 21, 2025Updated 11 months ago
SPIRAL-MED / DiagnosisArena
View on GitHub
☆33Jun 26, 2026Updated last month
OntoGene / PyBioC
View on GitHub
Python library for working with BioC files
☆13Mar 28, 2018Updated 8 years ago
lbox-kr / kbl
View on GitHub
Korean Benchmark for Korean Legal Language Understanding
☆19Nov 16, 2024Updated last year
WGLab / PhenoGPT
View on GitHub
☆32Mar 15, 2025Updated last year
repozhang / malevolent_dialogue
View on GitHub
MDRDC dataset and used baselines
☆11Feb 20, 2023Updated 3 years ago
multilexsum / dataset
View on GitHub
Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits
☆23Dec 15, 2022Updated 3 years ago
HicServices / SynthEHR
View on GitHub
Library and CLI for randomly generating medical data like you might get out of an Electronic Health Records (EHR) system
☆37Nov 21, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jacobvsdanniel / pubmedkb_core
View on GitHub
☆15Aug 24, 2022Updated 3 years ago
gmpoli / electramed
View on GitHub
☆13Oct 20, 2022Updated 3 years ago
JHart96 / keras_gcn_sequence_labelling
View on GitHub
Keras implementation of graph convolutional networks for sequence labelling
☆12Sep 21, 2018Updated 7 years ago
suamin / MedDistant19
View on GitHub
MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)
☆19Oct 13, 2022Updated 3 years ago
disanda / RFM
View on GitHub
A fast method for real face morphing (一个可以快速部署实现的人脸变形方法)
☆11May 31, 2022Updated 4 years ago
aground5 / livid-community
View on GitHub
☆17Mar 21, 2026Updated 4 months ago
lasigeBioTM / IHP
View on GitHub
Identification of Human Phenotype Entities
☆11Nov 2, 2018Updated 7 years ago
digitalprk / KoreaNER
View on GitHub
Bi-LSTM - CRF Named Entity Recognition model for Korean (Keras)
☆16Feb 7, 2018Updated 8 years ago
Zerohertz / Instruct_KR_2025_Summer_Meetup_vLLM
View on GitHub
🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹
☆23Aug 2, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yanghanxy / CIAN
View on GitHub
Implementation of the Character-level Intra Attention Network (CIAN) for Natural Language Inference (NLI) upon SNLI and MultiNLI corpus
☆17Nov 24, 2017Updated 8 years ago
Maize-Genetics-and-Genomics-Database / PanEffect
View on GitHub
PanEffect is a JavaScript framework to explore variant effects across a pangenome. The tool has two views that allows a user to (1) expl…
☆13Jan 30, 2024Updated 2 years ago
lasigeBioTM / PGR
View on GitHub
A Silver Standard Corpus of Human Phenotype-Gene Relations
☆13Oct 6, 2022Updated 3 years ago
MAGIC-AI4Med / MedRBench
View on GitHub
[Nature Communications] The official code for "Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases".
☆70Nov 7, 2025Updated 8 months ago
prajwaltr93 / teaching_robots_to_draw
View on GitHub
an attempt at implementing deep learning model proposed in paper teaching robots to draw
☆11Aug 13, 2021Updated 4 years ago
JanaSperschneider / NuclearPhaser
View on GitHub
Phasing of dikaryotic fungal genome assemblies
☆13Mar 1, 2023Updated 3 years ago
whybe-choi / kovidore-benchmark
View on GitHub
[ACL'26 Workshop] KoViDoRe: Korean Visual Document Retrieval Benchmark
☆24Jul 2, 2026Updated 3 weeks ago
ekg / ACAD18
View on GitHub
Day 2 of ACAD's 2018 Advanced Bioinformatics Workshop
☆12Nov 27, 2018Updated 7 years ago
liseda-lab / kgsim-benchmark
View on GitHub
☆13Mar 15, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
apeltzer / DeDup
View on GitHub
A merged read deduplication tool capable to perform merged read deduplication on single end data.
☆14Sep 4, 2024Updated last year
evanbiederstedt / edlibR
View on GitHub
R integration for edlib, a C/C++ library for pairwise sequence alignment using edit distance (Levenshtein distance).
☆11Jul 20, 2025Updated last year
desh2608 / attention_model_clinical_text
View on GitHub
This code implements attention network on top of the CNN used in the paper titled "Relation extraction from clinical texts using domain i…
☆13Oct 8, 2016Updated 9 years ago
gdbinit / delambert
View on GitHub
GreenLambert macOS IDA plugin to deobfuscate strings
☆14Oct 4, 2021Updated 4 years ago
mcfrith / local-rearrangements
View on GitHub
☆13Nov 15, 2017Updated 8 years ago
bayer-science-for-a-better-life / contrastive-reconstruction
View on GitHub
Tensorflow-keras implementation for Contrastive Reconstruction (ConRec) : a self-supervised learning algorithm that obtains image represe…
☆13Feb 22, 2022Updated 4 years ago
Amjad-Khalaf / Inverted-kmers
View on GitHub
Identifying large scale inversions between two genomes by mapping genome 1's unique kmers onto genome 2.
☆10Jun 6, 2025Updated last year