This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Report Generation".
β68Jun 28, 2025Updated 8 months ago
Alternatives and similar repositories for Reg2RG
Users that are interested in Reg2RG are comparing it to the libraries listed below
Sorting:
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approachβ19Nov 17, 2025Updated 3 months ago
- [ π― NeurIPS 2025 ] 3D-RAD π©»: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasksβ27Oct 28, 2025Updated 4 months ago
- [ACL 2025] βοΈ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,β¦β28Jan 10, 2026Updated last month
- β36Jan 9, 2026Updated 2 months ago
- Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generationβ18Nov 13, 2025Updated 3 months ago
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomographyβ91Oct 15, 2024Updated last year
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Geβ¦β11Jul 28, 2025Updated 7 months ago
- MICCAI 2024 & CT2Rep: Automated Radiology Report Generation for 3D Medical Imagingβ118Jul 1, 2024Updated last year
- KAIST medical VL research groupβ20Dec 20, 2024Updated last year
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]β42Jun 29, 2025Updated 8 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".β49Jan 6, 2026Updated 2 months ago
- [CVPR'25] Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generationβ84Feb 8, 2026Updated last month
- Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL β¦β24Jul 12, 2025Updated 7 months ago
- β18Aug 21, 2023Updated 2 years ago
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Modelsβ111Jul 7, 2025Updated 8 months ago
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2β¦β17Dec 11, 2024Updated last year
- [NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicineβ28Mar 10, 2025Updated 11 months ago
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These βsuperhumanβ reports are more accurate, detailed, standardized, β¦β199Dec 31, 2025Updated 2 months ago
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)β23Mar 6, 2025Updated last year
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasksβ45Oct 18, 2025Updated 4 months ago
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancementβ35Oct 18, 2025Updated 4 months ago
- β45Mar 2, 2026Updated last week
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"β25Feb 21, 2025Updated last year
- (AAAI-2025 oral) LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contextsβ52Jun 12, 2025Updated 8 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".β23Sep 19, 2024Updated last year
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)β24Feb 11, 2026Updated 3 weeks ago
- Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Imagesβ11Jan 11, 2024Updated 2 years ago
- β202Sep 22, 2025Updated 5 months ago
- β40Mar 15, 2023Updated 2 years ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Modelsβ424Apr 13, 2025Updated 10 months ago
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomographyβ361Jul 18, 2025Updated 7 months ago
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"β46Apr 24, 2025Updated 10 months ago
- Offical code of Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training[ICML 2024]β24May 31, 2024Updated last year
- LLaVa Version of RaDialogβ26May 27, 2025Updated 9 months ago
- [MICCAIβ25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessmentβ17Feb 27, 2026Updated last week
- β11Jun 21, 2025Updated 8 months ago
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imagingβ32Nov 4, 2025Updated 4 months ago
- β32Oct 6, 2024Updated last year
- [MICCAI 2025 Best Paper Award] Learning Segmentation from Radiology Reportsβ110Feb 28, 2026Updated last week