alibaba-damo-academy/fvlm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba-damo-academy/fvlm)

alibaba-damo-academy / fvlm

Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding (ICLR 2025)

☆130

Alternatives and similar repositories for fvlm

Users that are interested in fvlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ibrahimethemhamamci / CT-CLIP
View on GitHub
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography
☆410Jul 18, 2025Updated last year
StanfordMIMI / Merlin
View on GitHub
[Nature 2026] Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured …
☆454May 23, 2026Updated 2 months ago
ibrahimethemhamamci / CT-CHAT
View on GitHub
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography
☆116Oct 15, 2024Updated last year
ibrahimethemhamamci / BTB3D
View on GitHub
[NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging
☆42Nov 4, 2025Updated 8 months ago
MrGiovanni / RadGPT
View on GitHub
[ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …
☆215Dec 31, 2025Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
forithmus / VLM3D-Dockers
View on GitHub
VLM3D: Vision-Language Modeling in 3D Medical Imaging
☆16Jun 6, 2026Updated last month
zhi-xuan-chen / Reg2RG
View on GitHub
This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…
☆73Jun 28, 2025Updated last year
MrGiovanni / R-Super
View on GitHub
[MICCAI 2025 Best Paper Award] Learning Segmentation from Radiology Reports
☆126Jun 29, 2026Updated last month
ZJU4HealthCare / OmniCT
View on GitHub
【ICLR 2026】 Official Repo for Paper ‘’OmniCT: Towards a Unified Slice-Volume LVLM for Comprehensive CT Analysis‘’
☆18Mar 4, 2026Updated 4 months ago
mk-runner / Awesome-Radiology-Report-Generation
View on GitHub
paper list, dataset, and tools for radiology report generation
☆467Updated this week
JerrryNie / ConceptCLIP
View on GitHub
☆26Jun 11, 2026Updated last month
project-lighter / CT-FM
View on GitHub
CT-FM: A 3D Image-Based Foundation Model for Computed Tomography
☆70Apr 22, 2026Updated 3 months ago
YichiZhang98 / PET2Rep
View on GitHub
[AAAI'26] PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography
☆24Dec 26, 2025Updated 7 months ago
BAAI-DCAI / M3D
View on GitHub
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
☆454Apr 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Luffy03 / Large-Scale-Medical
View on GitHub
[TPAMI 2026] Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
☆283Apr 9, 2026Updated 3 months ago
Luffy03 / FreeTumor
View on GitHub
[Nature Communications 2025] Large-Scale Generative Tumor Synthesis in Computed Tomography Images for Improving Tumor Recognition
☆63Mar 20, 2026Updated 4 months ago
charlierabea / FORTE
View on GitHub
The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"
☆50Apr 24, 2025Updated last year
zch0414 / hlip
View on GitHub
Towards Scalable Language-Image Pre-training for 3D Medical Imaging [TMLR 2026]
☆53Jul 13, 2026Updated 2 weeks ago
Awenbocc / GEMeX-Project
View on GitHub
Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]
☆48Jun 29, 2025Updated last year
mirthAI / Med3DVLM
View on GitHub
☆136May 12, 2026Updated 2 months ago
HINTLab / LeFusion
View on GitHub
[ICLR‘25 Spotlight] LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models
☆149Jun 16, 2026Updated last month
ibrahimethemhamamci / GenerateCT
View on GitHub
ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
☆194Jul 3, 2024Updated 2 years ago
Schuture / DeepTumorVQA
View on GitHub
DeepTumorVQA benchmark for VLMs and Agents (10k testing samples)
☆40May 19, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uni-medical / Project-Imaging-X
View on GitHub
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
☆468Apr 3, 2026Updated 3 months ago
MrGiovanni / ScaleMAI
View on GitHub
☆24Jan 11, 2025Updated last year
zhaoziheng / SAT
View on GitHub
[npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"
☆307Dec 29, 2025Updated 7 months ago
Tang-xiaoxiao / 3D-RAD
View on GitHub
[ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
☆34Jun 22, 2026Updated last month
YuliWanghust / BrainMD
View on GitHub
[NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection
☆24Mar 25, 2026Updated 4 months ago
MrGiovanni / DiffTumor
View on GitHub
[CVPR 2024] Generalizable Tumor Synthesis - Realistic Synthetic Tumors in Liver, Pancreas, and Kidney
☆219Feb 18, 2026Updated 5 months ago
Leevan001 / MedReason-R1
View on GitHub
MEDREASON-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom
☆16Oct 10, 2025Updated 9 months ago
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 8 months ago
ibrahimethemhamamci / CT2Rep
View on GitHub
MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging
☆126Jul 1, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MrGiovanni / TextoMorph
View on GitHub
[IEEE TMI] Tumor synthesis leveraging medical reports.
☆49Jan 26, 2026Updated 6 months ago
MedAIerHHL / CVPR-MIA
View on GitHub
Papers of Medical Image Analysis on CVPR
☆496Jun 20, 2025Updated last year
jinlab-imvr / 3DMedAgent
View on GitHub
[2026 ICML] 3DMedAgent: Unified Perception-to-Understanding for 3D Medical Analysis
☆26May 25, 2026Updated 2 months ago
Luffy03 / GF-Screen
View on GitHub
[ICLR 2026] Glance and Focus Reinforcement for Pan-cancer Screening
☆36May 14, 2026Updated 2 months ago
tangyuhao2016 / CTRG
View on GitHub
☆19Aug 21, 2023Updated 2 years ago
MrGiovanni / SMILE
View on GitHub
☆45Jan 26, 2026Updated 6 months ago
Stanford-AIMI / GREEN
View on GitHub
[EMNLP, Findings 2024] a radiology report generation metric that leverages the natural language understanding of language models to ident…
☆85Sep 9, 2025Updated 10 months ago