Aofei-Chang/MedHEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Aofei-Chang/MedHEval)

Aofei-Chang / MedHEval

Repo for preprint 2025 "MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models"

☆16

Alternatives and similar repositories for MedHEval

Users that are interested in MedHEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ydk122024 / Med-HallMark
View on GitHub
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models
☆14Jun 24, 2024Updated 2 years ago
AQ-MedAI / LiveClin
View on GitHub
LiveClin is a live benchmark designed for the faithful replication of clinical practice
☆16Feb 27, 2026Updated 5 months ago
medhalt / medhalt
View on GitHub
☆34Mar 7, 2026Updated 4 months ago
Jack-ZC8 / M3AV-dataset
View on GitHub
[ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
☆24May 29, 2025Updated last year
mshehrozsajjad / Age-Classification
View on GitHub
Age and gender classification is a dual-task of identifying the age and gender of a person from an image or video.
☆12Apr 16, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
BDBC-KG-NLP / Recurrent_Interaction_Network_EMNLP2020
View on GitHub
Here is the code for the paper ``Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations'' accepted by EM…
☆13Nov 17, 2021Updated 4 years ago
pixas / TAIA_LLM
View on GitHub
☆17Nov 1, 2024Updated last year
lxirich / OphthaReason
View on GitHub
☆22Sep 4, 2025Updated 10 months ago
ZhiluZhang123 / neurips_2020_distillation
View on GitHub
Code for "Self-Distillation as Instance-Specific Label Smoothing"
☆15Oct 22, 2020Updated 5 years ago
promptslab / RosettaEval
View on GitHub
LLMEval
☆11Feb 12, 2024Updated 2 years ago
AICAN-Research / learn-pathology
View on GitHub
A web-based system for learning pathology
☆14Updated this week
natalies-teaching / Comp790-166-CompBio-Spring2022
View on GitHub
Course page for comp790-166 computational biology in spring 2022
☆11Apr 25, 2022Updated 4 years ago
Davidczy / Uni4Eye_pp
View on GitHub
☆16Oct 31, 2024Updated last year
XixiLiu95 / GEN
View on GitHub
Official repository for CVPR2023 publication, GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection
☆19Sep 25, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Hao-Ning / MEIDTM-Instance-Dependent-Label-Noise-Learning-with-Manifold-Regularized-Transition-Matrix-Estimatio
View on GitHub
pytorch
☆10Apr 13, 2022Updated 4 years ago
luka-group / CoIN
View on GitHub
☆14Jun 11, 2024Updated 2 years ago
BlueZeros / ReflecTool
View on GitHub
Benchmark, Toolbox, and Reflection-based Method for Clinical Agent
☆22Nov 6, 2024Updated last year
Merrical / PADL
View on GitHub
Code for Modeling Annotator Preference and Stochastic Annotation Error for Medical Image Segmentation (MedIA 2023).
☆10Nov 17, 2023Updated 2 years ago
KyleKWKim / LLM-guided-Multimodal-MIL
View on GitHub
[2024][MICCAI] LLM-guided Multi-modal Multiple Instance Learning for 5-year Overall Survival Prediction of Lung Cancer
☆18Mar 30, 2026Updated 4 months ago
openseadragon / html-overlay
View on GitHub
An OpenSeadragon plugin that adds HTML overlay capability.
☆14Oct 28, 2022Updated 3 years ago
martinagvilas / vit-cls_emb
View on GitHub
Accompanying code for "Analyzing Vision Tranformers in Class Embedding Space" (NeurIPS '23)
☆16Jun 10, 2024Updated 2 years ago
GregxmHu / OccuBench
View on GitHub
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
☆21Apr 14, 2026Updated 3 months ago
key1589745 / decouple_predict
View on GitHub
☆14Nov 29, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yukiti2007 / sample
View on GitHub
☆11Jun 27, 2023Updated 3 years ago
Vekteur / probabilistic-calibration-study
View on GitHub
Implementation of "A Large-Scale Study of Probabilistic Calibration in Neural Network Regression" (ICML 2023)
☆11Oct 7, 2025Updated 9 months ago
sangminwoo / RITUAL
View on GitHub
Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…
☆14Dec 16, 2024Updated last year
ForJadeForest / LIVE-Learnable-In-Context-Vector
View on GitHub
【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185
☆23May 31, 2025Updated last year
deeplearning-wisc / understanding_lp
View on GitHub
Official implementation of the paper "Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding"
☆18Sep 30, 2025Updated 10 months ago
yycunc / SAMEclustering
View on GitHub
SAME (Single-cell RNA-seq Aggregated clustering via Mixture model Ensemble): Cluster ensemble for single-cell RNA-seq data
☆16May 4, 2021Updated 5 years ago
paulgavrikov / vlm_shapebias
View on GitHub
Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).
☆30Jan 26, 2025Updated last year
andreped / DMDetect
View on GitHub
Code relevant for training, evaluating, assessing, and deploying CNNs for image classification and segmentation of Digital Mammography im…
☆10Mar 31, 2023Updated 3 years ago
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xuefeng-li1 / Provably-end-to-end-label-noise-learning-without-anchor-points
View on GitHub
☆15Jun 9, 2021Updated 5 years ago
jinlHe / PeFoMed
View on GitHub
The code for paper: PeFoMed: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering
☆64Dec 21, 2025Updated 7 months ago
PabloMessina / MedVQA
View on GitHub
☆13Jan 3, 2026Updated 6 months ago
annotorious / annotorious-v2-selector-pack
View on GitHub
Additional selection tools for Annotorious and the Annotorious OpenSeadragon plugin
☆15Jul 21, 2024Updated 2 years ago
lisa-wm / entropybaseduq
View on GitHub
☆12Apr 4, 2025Updated last year
FreedomIntelligence / HuatuoGPT-Vision
View on GitHub
Medical Multimodal LLMs
☆398Apr 23, 2025Updated last year
hooman007 / ProtoASNet
View on GitHub
Official repository for the paper "ProtoASNet: Dynamic Prototypes for Inherently Interpretable and Uncertainty-Aware Aortic Stenosis Clas…
☆13Oct 21, 2023Updated 2 years ago