yezanting/Med-VLM-Bench-Summary

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yezanting/Med-VLM-Bench-Summary)

yezanting / Med-VLM-Bench-Summary

A Curated Benchmark Repository for Medical Vision-Language Models

☆198

Alternatives and similar repositories for Med-VLM-Bench-Summary

Users that are interested in Med-VLM-Bench-Summary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YichiZhang98 / PET2Rep
View on GitHub
[AAAI'26] PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography
☆24Dec 26, 2025Updated 7 months ago
CUHK-AIM-Group / MedSAM-Agent
View on GitHub
MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning
☆89Feb 5, 2026Updated 5 months ago
Awenbocc / GEMeX-Project
View on GitHub
Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]
☆48Jun 29, 2025Updated last year
Schuture / DeepTumorVQA
View on GitHub
DeepTumorVQA benchmark for VLMs and Agents (10k testing samples)
☆40May 19, 2026Updated 2 months ago
mk-runner / MLRG
View on GitHub
[CVPR'25] Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
☆104Jun 2, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CUHK-AIM-Group / AMID
View on GitHub
AMID: Towards Autonomous and Auditable Medical Imaging Model Development
☆20Jul 14, 2026Updated 2 weeks ago
lab-rasool / Awesome-Medical-VLMs-and-Datasets
View on GitHub
A list of VLMs tailored for medical RG and VQA; and a list of medical vision-language datasets
☆230Mar 19, 2025Updated last year
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 8 months ago
zzma2 / medical-llm-reasoning-survey
View on GitHub
A curated list of medical reasoning research on large language models, organized by modality, technique, application, and benchmark.
☆19Oct 17, 2025Updated 9 months ago
microsoft / LLaVA-Rad
View on GitHub
Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.
☆58Jan 22, 2026Updated 6 months ago
ShawnHuang497 / MedPLIB
View on GitHub
The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'
☆134Jul 7, 2026Updated 3 weeks ago
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆25May 12, 2026Updated 2 months ago
MrGiovanni / RadGPT
View on GitHub
[ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …
☆215Dec 31, 2025Updated 6 months ago
Yuxiang-Lai117 / Med-R1
View on GitHub
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
☆129Jul 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xmed-lab / MedRegA
View on GitHub
[ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks
☆46Oct 18, 2025Updated 9 months ago
mk-runner / Awesome-Radiology-Report-Generation
View on GitHub
paper list, dataset, and tools for radiology report generation
☆467Updated this week
alibaba-damo-academy / MedEvalKit
View on GitHub
MedEvalKit: A Unified Medical Evaluation Framework
☆247Feb 24, 2026Updated 5 months ago
uni-medical / Project-Imaging-X
View on GitHub
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
☆468Apr 3, 2026Updated 3 months ago
zhi-xuan-chen / Reg2RG
View on GitHub
This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…
☆73Jun 28, 2025Updated last year
Tang-xiaoxiao / 3D-RAD
View on GitHub
[ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
☆34Jun 22, 2026Updated last month
shaohao011 / MedCCO
View on GitHub
[ACM MM2026] This is the official implementation of MedCCO
☆17Jul 12, 2026Updated 2 weeks ago
richard-peng-xia / awesome-multimodal-in-medical-imaging
View on GitHub
A collection of resources on applications of multi-modal learning in medical imaging.
☆973Jul 21, 2026Updated last week
Cocofeat / uMedGround
View on GitHub
【IEEE TPAMI 2025】Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
☆35Jul 9, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
eth-medical-ai-lab / rad-agent
View on GitHub
RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography
☆42Jun 3, 2026Updated last month
kevinwu23 / Stanford-MedCaseReasoning
View on GitHub
☆51Jun 2, 2025Updated last year
Luffy03 / GF-Screen
View on GitHub
[ICLR 2026] Glance and Focus Reinforcement for Pan-cancer Screening
☆36May 14, 2026Updated 2 months ago
zenghy96 / Reliable-Source-Approximation
View on GitHub
Reliable Source Approximation: Source-Free Domain Adaptation for Vestibular Schwannoma MRI Segmentation
☆11Dec 28, 2024Updated last year
Masaaki-75 / progemu
View on GitHub
Official repository for the paper "Towards Interpretable Counterfactual Generation via Multimodal Autoregression"
☆17Nov 7, 2025Updated 8 months ago
Xu-Huihui / MedGround-R1
View on GitHub
Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…
☆41Sep 28, 2025Updated 10 months ago
CUHK-AIM-Group / OmniBrainBench
View on GitHub
[CVPR 2026] OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks
☆17Mar 29, 2026Updated 4 months ago
AIoT-Lab-AI4LIFE / ViPET-ReportGen
View on GitHub
[NeurIPS 2025] Toward a Vision-Language Foundation Model for Medical Data: Multimodal Dataset and Benchmarks for Vietnamese PET/CT Report…
☆15Jul 22, 2026Updated last week
ZrH42 / UniX
View on GitHub
☆31Mar 29, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ibrahimethemhamamci / CT-CLIP
View on GitHub
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography
☆410Jul 18, 2025Updated last year
function2-llx / MMMM
View on GitHub
[NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
☆31Mar 10, 2025Updated last year
thibault-wch / Brain-GBM-world-model
View on GitHub
This is a code implementation of the Brain-WM proposed in the manuscript "Brain-WM: Brain Glioblastoma World Model".
☆23Jun 11, 2026Updated last month
LijunRio / AG-KD
View on GitHub
This repository contains the code for our paper: Enhancing Abnormality Grounding for Vision-Language Models with Knowledge Descriptions
☆19Jun 24, 2025Updated last year
UbiquantAI / Fleming-VL
View on GitHub
Fleming-VL: Towards Universal Medical Visual Understanding with Multimodal LLMs
☆15Nov 6, 2025Updated 8 months ago
Project-MONAI / VLM-Radiology-Agent-Framework
View on GitHub
☆220Sep 22, 2025Updated 10 months ago
BAAI-DCAI / M3D
View on GitHub
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
☆454Apr 13, 2025Updated last year