BAAI-DCAI / M3DView external linksLinks
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
☆421Apr 13, 2025Updated 10 months ago
Alternatives and similar repositories for M3D
Users that are interested in M3D are comparing it to the libraries listed below
Sorting:
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆347Jul 18, 2025Updated 6 months ago
- The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".☆524Jul 25, 2025Updated 6 months ago
- The official code for "SegVol: Universal and Interactive Volumetric Medical Image Segmentation".☆366Jan 11, 2026Updated last month
- [npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"☆280Dec 29, 2025Updated last month
- [ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.☆663Oct 24, 2025Updated 3 months ago
- [ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)☆396Jan 13, 2026Updated last month
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging☆118Jul 1, 2024Updated last year
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆90Oct 15, 2024Updated last year
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆117Jan 16, 2026Updated last month
- [NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)☆298Nov 24, 2025Updated 2 months ago
- A collection of resources on applications of multi-modal learning in medical imaging.☆913Feb 8, 2026Updated last week
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"☆46Apr 24, 2025Updated 9 months ago
- EMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts☆664Apr 12, 2024Updated last year
- The official code for MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in Radiology. We propose to leverage medical specif…☆178Sep 4, 2023Updated 2 years ago
- A Survey on CLIP in Medical Imaging☆501Mar 26, 2025Updated 10 months ago
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆85Jun 4, 2025Updated 8 months ago
- [TPAMI 2025] Large-Scale 3D Medical Image Pre-training with Geometric Context Priors☆238Jan 13, 2026Updated last month
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆18Nov 25, 2024Updated last year
- [NeurIPS 2024] Touchstone - Benchmarking AI on 5,172 o.o.d. CT volumes and 9 anatomical structures☆131Nov 24, 2025Updated 2 months ago
- SAM-Med3D: An Efficient General-purpose Promptable Segmentation Model for 3D Volumetric Medical Image☆860Sep 21, 2025Updated 4 months ago
- ☆112Sep 4, 2025Updated 5 months ago
- [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…☆399Jul 11, 2025Updated 7 months ago
- [CVPR 2023] Label-Free Liver Tumor Segmentation☆374Jan 26, 2026Updated 2 weeks ago
- Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology repo…☆191Oct 22, 2025Updated 3 months ago
- ☆59Jun 18, 2024Updated last year
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆49Jan 6, 2026Updated last month
- [CVPR 2024] VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis☆216Dec 1, 2025Updated 2 months ago
- A Python tool to evaluate the performance of VLM on the medical domain.☆83Aug 5, 2025Updated 6 months ago
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆194Dec 31, 2025Updated last month
- ☆25Jan 11, 2025Updated last year
- This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…☆66Jun 28, 2025Updated 7 months ago
- BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks☆702Jul 8, 2025Updated 7 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 3 months ago
- ☆32Mar 25, 2025Updated 10 months ago
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆31Nov 4, 2025Updated 3 months ago
- paper list, dataset, and tools for radiology report generation☆360Updated this week
- The official repository to build SAT-DS, a medical data collection of over 72 public segmentation datasets, contains over 22K 3D images, …☆138Dec 3, 2025Updated 2 months ago
- ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes☆186Jul 3, 2024Updated last year
- ☆202Sep 22, 2025Updated 4 months ago