lab-rasool / Awesome-Medical-VLMs-and-DatasetsView external linksLinks
A list of VLMs tailored for medical RG and VQA; and a list of medical vision-language datasets
☆212Mar 19, 2025Updated 10 months ago
Alternatives and similar repositories for Awesome-Medical-VLMs-and-Datasets
Users that are interested in Awesome-Medical-VLMs-and-Datasets are comparing it to the libraries listed below
Sorting:
- LLaVa Version of RaDialog☆26May 27, 2025Updated 8 months ago
- [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning☆48Dec 21, 2025Updated last month
- A collection of resources on Medical Vision-Language Models☆106Dec 23, 2023Updated 2 years ago
- DeepTumorVQA benchmark (9262 CT images + 395k QA pairs)☆30Jul 8, 2025Updated 7 months ago
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆81Dec 17, 2024Updated last year
- Official code for the Paper "RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance"☆113Jun 4, 2025Updated 8 months ago
- paper list, dataset, and tools for radiology report generation☆360Updated this week
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆154Jul 7, 2025Updated 7 months ago
- A collection of resources on applications of multi-modal learning in medical imaging.☆913Feb 8, 2026Updated last week
- A framework for Longitudinal Radiology Report Generation☆27Aug 10, 2024Updated last year
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆104Jul 7, 2025Updated 7 months ago
- This is the official code of "MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report Generation" (AAAI 2025 oral)☆30Dec 5, 2025Updated 2 months ago
- Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.☆2,134Jun 4, 2025Updated 8 months ago
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆42Jun 29, 2025Updated 7 months ago
- A Python tool to evaluate the performance of VLM on the medical domain.☆83Aug 5, 2025Updated 6 months ago
- The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".☆524Jul 25, 2025Updated 6 months ago
- Learning to Use Medical Tools with Multi-modal Agent☆228Feb 7, 2026Updated last week
- ☆53Jun 2, 2024Updated last year
- Radiology Objects in COntext (ROCO): A Multimodal Image Dataset☆239Apr 5, 2022Updated 3 years ago
- ☆202Sep 22, 2025Updated 4 months ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆421Apr 13, 2025Updated 10 months ago
- A Curated Benchmark Repository for Medical Vision-Language Models☆179Jan 21, 2026Updated 3 weeks ago
- [npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"☆280Dec 29, 2025Updated last month
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆45Oct 18, 2025Updated 3 months ago
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging☆118Jul 1, 2024Updated last year
- [CVPR'25] Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation☆84Feb 8, 2026Updated last week
- A multi-modal CLIP model trained on the medical dataset ROCO☆149Jun 4, 2025Updated 8 months ago
- [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…☆399Jul 11, 2025Updated 7 months ago
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆117Jan 16, 2026Updated last month
- Collection of awesome medical dataset resources.☆1,637Jan 23, 2025Updated last year
- The official GitHub repository of the survey paper "A Systematic Review of Deep Learning-based Research on Radiology Report Generation".☆96May 17, 2025Updated 8 months ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- ☆103May 26, 2025Updated 8 months ago
- Code implementation of ProtoSAM - One Shot Medical Image Segmentation with Foundationl Models☆44Oct 10, 2024Updated last year
- Foundation models based medical image analysis☆208Jan 6, 2026Updated last month
- [MICCAI'24] Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation☆24Apr 14, 2025Updated 10 months ago
- Offical code of Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training[ICML 2024]☆24May 31, 2024Updated last year
- One-Prompt to Segment All Medical Images [CVPR 2024]☆140Jun 14, 2024Updated last year
- Pyramid Attention Network for Medical Image Registration (ISBI 2024)☆16Feb 6, 2025Updated last year