[Nature 2026] Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.
☆321Mar 15, 2026Updated this week
Alternatives and similar repositories for Merlin
Users that are interested in Merlin are comparing it to the libraries listed below
Sorting:
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆118Jan 16, 2026Updated 2 months ago
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆369Jul 18, 2025Updated 8 months ago
- ☆55Dec 11, 2024Updated last year
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆97Oct 15, 2024Updated last year
- The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".☆528Jul 25, 2025Updated 7 months ago
- [MIDL 2025] Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders☆176Feb 20, 2026Updated last month
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆202Dec 31, 2025Updated 2 months ago
- Foundation 3D ViT model for volumetric head CT☆49Sep 29, 2025Updated 5 months ago
- [ICCV 2025] Dataset of 10,135 abdominal CT scans with 15,130 tumors annotated across six organs and 5,893 controls. The AI ranks first in…☆52Nov 3, 2025Updated 4 months ago
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆55Jan 22, 2026Updated last month
- [ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)☆402Jan 13, 2026Updated 2 months ago
- ☆37Jan 26, 2026Updated last month
- CT-FM: A 3D Image-Based Foundation Model for Computed Tomography☆66Feb 13, 2025Updated last year
- [NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)☆299Nov 24, 2025Updated 3 months ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆424Apr 13, 2025Updated 11 months ago
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- ☆46Mar 9, 2026Updated last week
- This repository provides a 3D implementation of DINOv2 for self-supervised pretraining on volumetric (3D) medical images using Lightly, M…☆50Mar 12, 2026Updated last week
- ☆194Feb 21, 2025Updated last year
- A curated list of foundation models for vision and language tasks in medical imaging☆300Jun 3, 2024Updated last year
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging☆118Jul 1, 2024Updated last year
- Tool for robust segmentation of >100 important anatomical structures in CT and MR images☆2,526Mar 13, 2026Updated last week
- [Nature Machine Intelligence 2024] Code and evaluation repository for the paper☆132Mar 5, 2025Updated last year
- paper list, dataset, and tools for radiology report generation☆378Mar 13, 2026Updated last week
- 🏆1st place in the PANORAMA challenge (early detection of PDAC on contrast-enhanced CT)☆15Jan 13, 2026Updated 2 months ago
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancement☆36Oct 18, 2025Updated 5 months ago
- [NeurIPS 2024] Touchstone - Benchmarking AI on 5,172 o.o.d. CT volumes and 9 anatomical structures☆131Nov 24, 2025Updated 3 months ago
- [ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.☆669Oct 24, 2025Updated 4 months ago
- Computed tomography to body composition (Comp2Comp).☆110Jan 6, 2026Updated 2 months ago
- ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field☆187Oct 9, 2025Updated 5 months ago
- [MICCAI 2025 Best Paper Award] Learning Segmentation from Radiology Reports☆110Mar 9, 2026Updated last week
- ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes☆188Jul 3, 2024Updated last year
- [EMNLP, Findings 2024] a radiology report generation metric that leverages the natural language understanding of language models to ident…☆71Sep 9, 2025Updated 6 months ago
- [CVPR 2024] VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis☆219Dec 1, 2025Updated 3 months ago
- MONAI Versatile Imaging Segmentation and Annotation☆267Jan 18, 2026Updated 2 months ago
- [ICCV 2025] MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs☆32Jan 26, 2026Updated last month
- ☆117Sep 4, 2025Updated 6 months ago
- [IEEE TMI] Tumor synthesis leveraging medical reports.☆48Jan 26, 2026Updated last month
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆32Nov 4, 2025Updated 4 months ago