mbzuai-oryx / MIRALinks
[ACM MM 2025 π₯π₯ ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic context control to boost factual accuracy in multimodal medical reasoning.
β16Updated 3 weeks ago
Alternatives and similar repositories for MIRA
Users that are interested in MIRA are comparing it to the libraries listed below
Sorting:
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)β12Updated last year
- β19Updated 4 months ago
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).β28Updated 3 months ago
- β21Updated 4 months ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimizationβ53Updated 3 months ago
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2β¦β18Updated 9 months ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answeringβ21Updated 4 months ago
- β20Updated 2 years ago
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Dataβ22Updated 3 months ago
- β40Updated 10 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"β22Updated 7 months ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learningβ25Updated 5 months ago
- The repo of ASGMVLPβ17Updated last year
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).β55Updated 11 months ago
- Source code for the paper "A Medical Semantic-Assisted Transformer for Radiographic Report Generation"β25Updated 2 years ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'β29Updated 10 months ago
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).β23Updated 2 months ago
- MedVLThinker: Simple Baselines for Multimodal Medical Reasoningβ34Updated last month
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]β27Updated 2 months ago
- [ACMMM-2022] This is the official implementation of Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowβ¦β38Updated 2 years ago
- MC-CoT implementation codeβ19Updated 2 months ago
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approachβ14Updated last month
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)β23Updated 2 years ago
- Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathyβ15Updated 2 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICβ¦β17Updated 7 months ago
- [ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Promptsβ74Updated last year
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learningβ89Updated 2 months ago
- Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generationβ16Updated 2 years ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasksβ40Updated 2 months ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electrβ¦β88Updated last year