yangzhou12/awesome-medical-vision-language-models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yangzhou12/awesome-medical-vision-language-models)

yangzhou12 / awesome-medical-vision-language-models

A collection of resources on Medical Vision-Language Models

☆110

Alternatives and similar repositories for awesome-medical-vision-language-models

Users that are interested in awesome-medical-vision-language-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

richard-peng-xia / awesome-multimodal-in-medical-imaging
View on GitHub
A collection of resources on applications of multi-modal learning in medical imaging.
☆973Jul 21, 2026Updated last week
hwei-hw / Generalist_Vision_Foundation_Models_for_Medical_Imaging
View on GitHub
The repo of the paper: Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medic…
☆11May 26, 2023Updated 3 years ago
lab-rasool / Awesome-Medical-VLMs-and-Datasets
View on GitHub
A list of VLMs tailored for medical RG and VQA; and a list of medical vision-language datasets
☆230Mar 19, 2025Updated last year
cchen-cc / CMITM
View on GitHub
☆20Nov 4, 2023Updated 2 years ago
razorx89 / roco-dataset
View on GitHub
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
☆250Apr 5, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jbdel / vilmedic
View on GitHub
ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field
☆189Oct 9, 2025Updated 9 months ago
eth-medical-ai-lab / smmile
View on GitHub
[NeurIPS Datasets & Benchmarks 2025] SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning
☆15Dec 2, 2025Updated 7 months ago
ayanglab / SwinGANMR
View on GitHub
Official implementation of SwinGANMR
☆17Sep 5, 2022Updated 3 years ago
ChantalMP / Rad-ReStruct
View on GitHub
Official repository for the paper "Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting" (MICCAI23)
☆33Jan 4, 2024Updated 2 years ago
naamiinepal / medvlsm
View on GitHub
[MIDL 2024] Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models
☆71Nov 28, 2024Updated last year
zhjohnchan / PTUnifier
View on GitHub
[ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
☆78Mar 22, 2024Updated 2 years ago
zhjohnchan / M3AE
View on GitHub
[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.
☆134Sep 16, 2022Updated 3 years ago
xmindflow / Awesome-Foundation-Models-in-Medical-Imaging
View on GitHub
A curated list of foundation models for vision and language tasks in medical imaging
☆301Jun 3, 2024Updated 2 years ago
zou-group / OpenBiomedVid
View on GitHub
☆44Apr 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Kaushalya / medclip
View on GitHub
A multi-modal CLIP model trained on the medical dataset ROCO
☆151Jun 4, 2025Updated last year
YijinHuang / FPT
View on GitHub
[TNNLS'25] [MICCAI'24] A Parameter and Memory Efficient Transfer Learning Method
☆35Oct 29, 2025Updated 9 months ago
MediaBrain-SJTU / K-Diag
View on GitHub
☆10Aug 20, 2023Updated 2 years ago
chaoyi-wu / RadFM
View on GitHub
The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".
☆562Jul 25, 2025Updated last year
Aofei-Chang / MedHEval
View on GitHub
Repo for preprint 2025 "MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models"
☆16Apr 23, 2025Updated last year
estanley16 / SimBA
View on GitHub
Implementation for Simulated Bias in Artificial Medical Images (SimBA) framework 🦁
☆11Apr 1, 2025Updated last year
baeseongsu / mimic-cxr-vqa
View on GitHub
A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…
☆100Feb 6, 2026Updated 5 months ago
mlii0117 / DCL
View on GitHub
Official code for "Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation" (CVPR 2023)
☆120May 7, 2023Updated 3 years ago
CAMMA-public / attention-tripnet
View on GitHub
☆11Sep 17, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
burglarhobbit / Awesome-Medical-Large-Language-Models
View on GitHub
Curated papers on Large Language Models in Healthcare and Medical domain
☆393May 29, 2025Updated last year
JoshuaChou2018 / MedAGI
View on GitHub
Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost
☆39Jun 21, 2023Updated 3 years ago
snap-stanford / med-flamingo
View on GitHub
☆452Aug 23, 2023Updated 2 years ago
abhisheksambyal / Self-supervised-learning-by-context-prediction
View on GitHub
Implementation of "Unsupervised Visual Representation Learning by Context Prediction" by C. Doersh, A. Gupta and A. A. Efros
☆24Nov 18, 2021Updated 4 years ago
MoMarky / radiology-report-extraction
View on GitHub
Extract the findings and impression section of the radiology reports in the MIMIC-CXR-Report and OpenI datasets.
☆24May 25, 2023Updated 3 years ago
Holipori / Medical-CXR-VQA
View on GitHub
☆46Jan 21, 2025Updated last year
chunmeifeng / MARIO
View on GitHub
☆12Feb 19, 2022Updated 4 years ago
zhaozh10 / Awesome-CLIP-in-Medical-Imaging
View on GitHub
A Survey on CLIP in Medical Imaging
☆515Mar 26, 2025Updated last year
MediaBrain-SJTU / MedKLIP
View on GitHub
The official code for MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in Radiology. We propose to leverage medical specif…
☆181Sep 4, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
muxin-wei / Rep-MedSAM
View on GitHub
Top 3 solution for CVPR24 SEGMENT ANYTHING IN MEDICAL IMAGES ON LAPTOP Challenge
☆11Apr 8, 2025Updated last year
StanfordMIMI / RaLEs
View on GitHub
Radiology Language Evaluations
☆11Nov 17, 2023Updated 2 years ago
mobarakol / SVLS
View on GitHub
☆15Jul 4, 2023Updated 3 years ago
alexanderjaus / AtlasDataset
View on GitHub
☆47Dec 22, 2024Updated last year
trinhvg / ViDRiP-LLaVA
View on GitHub
ViDRiP-LLaVA: A Dataset and Benchmark for Diagnostic Reasoning from Pathology Videos
☆25May 21, 2025Updated last year
openmedlab / Awesome-Medical-Dataset
View on GitHub
Collection of awesome medical dataset resources.
☆2,058Jan 23, 2025Updated last year
DDI-Dataset / DDI-Code
View on GitHub
☆24Feb 27, 2023Updated 3 years ago