GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.
☆85Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for GMAI-VL
Users that are interested in GMAI-VL are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆39Jun 4, 2025Updated 9 months ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆424Apr 13, 2025Updated 10 months ago
- ☆21Nov 27, 2025Updated 3 months ago
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆120Jan 9, 2025Updated last year
- [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…☆401Jul 11, 2025Updated 7 months ago
- ☆59Jun 18, 2024Updated last year
- [NeurIPS'25 | CVPR'26] The official repo of OralGPT & MMOral Bench.☆73Updated this week
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancement☆35Oct 18, 2025Updated 4 months ago
- MSHub: Medical Image Segmentation Hub with Pre-trained nnUNets☆23Feb 28, 2025Updated last year
- ☆25Updated this week
- ☆25Jan 11, 2025Updated last year
- [Sci. Rep. 2025] Revisiting model scaling with a U-net benchmark for 3D medical image segmentation☆18Aug 21, 2025Updated 6 months ago
- [IEEE TMI] Tumor synthesis leveraging medical reports.☆48Jan 26, 2026Updated last month
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆42Jun 29, 2025Updated 8 months ago
- A Python tool to evaluate the performance of VLM on the medical domain.☆83Aug 5, 2025Updated 7 months ago
- The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"☆28Jan 22, 2025Updated last year
- One-line code to get SoTA pre-trained Medical Image Models ready in PyTorch.☆49Mar 29, 2025Updated 11 months ago
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆71Oct 23, 2025Updated 4 months ago
- The official repository to build SAT-DS, a medical data collection of over 72 public segmentation datasets, contains over 22K 3D images, …☆141Dec 3, 2025Updated 3 months ago
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆19Oct 19, 2023Updated 2 years ago
- CT-FM: A 3D Image-Based Foundation Model for Computed Tomography☆63Feb 13, 2025Updated last year
- [TPAMI 2025] Large-Scale 3D Medical Image Pre-training with Geometric Context Priors☆243Jan 13, 2026Updated last month
- ☆25May 12, 2025Updated 9 months ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Nov 5, 2024Updated last year
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆82Dec 17, 2024Updated last year
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆77Dec 4, 2024Updated last year
- MedEvalKit: A Unified Medical Evaluation Framework☆211Feb 24, 2026Updated 2 weeks ago
- [npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"☆283Dec 29, 2025Updated 2 months ago
- [NeurIPS 2024] Touchstone - Benchmarking AI on 5,172 o.o.d. CT volumes and 9 anatomical structures☆131Nov 24, 2025Updated 3 months ago
- SAM-Med2D: Bridging the Gap between Natural Image Segmentation and Medical Image Segmentation☆76Nov 19, 2023Updated 2 years ago
- [ICCV 2025] Medical World Model☆117Jul 31, 2025Updated 7 months ago
- Repository for the Universal Lesion Segmentation Challenge '23☆40May 11, 2025Updated 9 months ago
- A generalist foundation model for healthcare capable of handling diverse medical data modalities.☆92Apr 25, 2024Updated last year
- A metric suite leveraging the logical inference capabilities of LLMs, for radiology report generation both with and without grounding☆92Jan 16, 2026Updated last month
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆45Oct 18, 2025Updated 4 months ago
- ☆48Feb 26, 2025Updated last year
- This repository provides a 3D implementation of DINOv2 for self-supervised pretraining on volumetric (3D) medical images using Lightly, M…☆50Feb 18, 2026Updated 2 weeks ago
- [MICCAI 2023] Continual Learning for Abdominal Multi-Organ and Tumor Segmentation☆78Jul 30, 2024Updated last year
- The official code to build up dataset PMC-OA☆34Jul 16, 2024Updated last year