Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.
☆190Feb 20, 2026Updated last week
Alternatives and similar repositories for Merlin
Users that are interested in Merlin are comparing it to the libraries listed below
Sorting:
- Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)☆117Jan 16, 2026Updated last month
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆353Jul 18, 2025Updated 7 months ago
- ☆55Dec 11, 2024Updated last year
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆91Oct 15, 2024Updated last year
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆54Jan 22, 2026Updated last month
- Foundation 3D ViT model for volumetric head CT☆49Sep 29, 2025Updated 5 months ago
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆195Dec 31, 2025Updated 2 months ago
- [ICCV 2025] Dataset of 10,135 abdominal CT scans with 15,130 tumors annotated across six organs and 5,893 controls. The AI ranks first in…☆51Nov 3, 2025Updated 3 months ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆423Apr 13, 2025Updated 10 months ago
- [MIDL 2025] Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders☆175Feb 20, 2026Updated last week
- The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".☆524Jul 25, 2025Updated 7 months ago
- [ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)☆398Jan 13, 2026Updated last month
- ☆37Jan 26, 2026Updated last month
- CT-FM: A 3D Image-Based Foundation Model for Computed Tomography☆63Feb 13, 2025Updated last year
- [NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)☆298Nov 24, 2025Updated 3 months ago
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancement☆35Oct 18, 2025Updated 4 months ago
- This repository provides a 3D implementation of DINOv2 for self-supervised pretraining on volumetric (3D) medical images using Lightly, M…☆48Feb 18, 2026Updated last week
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- ☆194Feb 21, 2025Updated last year
- [NeurIPS 2024] Touchstone - Benchmarking AI on 5,172 o.o.d. CT volumes and 9 anatomical structures☆132Nov 24, 2025Updated 3 months ago
- ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field☆187Oct 9, 2025Updated 4 months ago
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging☆119Jul 1, 2024Updated last year
- ☆40Feb 20, 2026Updated last week
- Tool for robust segmentation of >100 important anatomical structures in CT and MR images☆2,483Feb 11, 2026Updated 2 weeks ago
- Towards Scalable Language-Image Pre-training for 3D Medical Imaging [TMLR 2026]☆33Updated this week
- [ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.☆665Oct 24, 2025Updated 4 months ago
- ☆26Aug 16, 2023Updated 2 years ago
- paper list, dataset, and tools for radiology report generation☆369Feb 20, 2026Updated last week
- [Nature Machine Intelligence 2024] Code and evaluation repository for the paper☆131Mar 5, 2025Updated 11 months ago
- Torch utilities for 3D medical imaging☆53Jan 16, 2026Updated last month
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆31Nov 4, 2025Updated 3 months ago
- [EMNLP, Findings 2024] a radiology report generation metric that leverages the natural language understanding of language models to ident…☆70Sep 9, 2025Updated 5 months ago
- [MICCAI 2025 Best Paper Award] Learning Segmentation from Radiology Reports☆108Updated this week
- [ICCV 2025] MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs☆31Jan 26, 2026Updated last month
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆19Nov 17, 2025Updated 3 months ago
- Computed tomography to body composition (Comp2Comp).☆109Jan 6, 2026Updated last month
- ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes☆187Jul 3, 2024Updated last year
- ☆113Sep 4, 2025Updated 5 months ago
- A curated list of foundation models for vision and language tasks in medical imaging☆300Jun 3, 2024Updated last year