alibaba-damo-academy / fvlmView external linksLinks
Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)
☆117Jan 16, 2026Updated last month
Alternatives and similar repositories for fvlm
Users that are interested in fvlm are comparing it to the libraries listed below
Sorting:
- Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology repo…☆190Oct 22, 2025Updated 3 months ago
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆347Jul 18, 2025Updated 6 months ago
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆90Oct 15, 2024Updated last year
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"☆46Apr 24, 2025Updated 9 months ago
- Towards Scalable Language-Image Pre-training for 3D Medical Imaging☆33Jan 30, 2026Updated 2 weeks ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆422Apr 13, 2025Updated 10 months ago
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆19Nov 17, 2025Updated 3 months ago
- paper list, dataset, and tools for radiology report generation☆360Feb 10, 2026Updated last week
- ☆25Jan 11, 2025Updated last year
- CT-FM: A 3D Image-Based Foundation Model for Computed Tomography☆63Feb 13, 2025Updated last year
- [Nature Communications 2025] Large-Scale Generative Tumor Synthesis in Computed Tomography Images for Improving Tumor Recognition☆53Dec 16, 2025Updated 2 months ago
- ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes☆186Jul 3, 2024Updated last year
- [ICLR‘25 Spotlight] LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models☆140Aug 27, 2025Updated 5 months ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆37Apr 21, 2025Updated 9 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆49Jan 6, 2026Updated last month
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging☆118Jul 1, 2024Updated last year
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆194Dec 31, 2025Updated last month
- [TPAMI 2025] Large-Scale 3D Medical Image Pre-training with Geometric Context Priors☆241Jan 13, 2026Updated last month
- [CVPR 2024] Generalizable Tumor Synthesis - Realistic Synthetic Tumors in Liver, Pancreas, and Kidney☆206Aug 9, 2025Updated 6 months ago
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆31Nov 4, 2025Updated 3 months ago
- [MICCAI 2025 Best Paper Award] Learning Segmentation from Radiology Reports☆104Updated this week
- The official code for "Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data".☆524Jul 25, 2025Updated 6 months ago
- [ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.☆663Oct 24, 2025Updated 3 months ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆18Aug 28, 2025Updated 5 months ago
- Towards Accurate and Lightweight Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis☆16Sep 8, 2025Updated 5 months ago
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆54Jan 22, 2026Updated 3 weeks ago
- [ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)☆398Jan 13, 2026Updated last month
- ☆69Feb 3, 2025Updated last year
- MMLNB: Multi-Modal Learning for Neuroblastoma Subtyping Classification Assisted with Textual Description Generation☆19Mar 20, 2025Updated 10 months ago
- ☆112Sep 4, 2025Updated 5 months ago
- [IEEE TMI] Tumor synthesis leveraging medical reports.☆48Jan 26, 2026Updated 3 weeks ago
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆42Jun 29, 2025Updated 7 months ago
- [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning☆48Dec 21, 2025Updated last month
- ☆16Dec 16, 2024Updated last year
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 3 months ago
- Pan-Tumor Radiology Foundation Model Utilizing Synthetic Training Data for Advanced Oncological Insights☆87Jan 6, 2026Updated last month
- ☆27Nov 2, 2023Updated 2 years ago
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆154Jul 7, 2025Updated 7 months ago