ZhuoZHI-UCL / ICL_multimodal
Code for paper 'Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity'
☆12Updated last year
Alternatives and similar repositories for ICL_multimodal
Users that are interested in ICL_multimodal are comparing it to the libraries listed below
Sorting:
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆27Updated last year
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…☆16Updated 5 months ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆23Updated 2 years ago
- ☆22Updated last year
- Source code for the paper "A Medical Semantic-Assisted Transformer for Radiographic Report Generation"☆22Updated last year
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆54Updated 7 months ago
- [ACMMM-2022] This is the official implementation of Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Know…☆38Updated 2 years ago
- The code for paper: PeFoM-Med: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆46Updated 3 weeks ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆33Updated 3 months ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆84Updated 8 months ago
- [MICCAI'24 Early Accept] Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations☆14Updated 11 months ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆31Updated last month
- Localized questions for VQA☆11Updated last week
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆13Updated 8 months ago
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆31Updated 5 months ago
- offical implementation of "Calibrating Multimodal Learning" on ICML 2023☆21Updated last year
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆39Updated 5 months ago
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆75Updated last month
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆19Updated 3 weeks ago
- This repository contains the code accompanying the paper "A Self-Guided Framework for Radiology Report Generation", accepted by MICCAI 20…☆19Updated last year
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data☆19Updated 5 months ago
- The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing…☆71Updated last month
- ☆20Updated 2 years ago
- The code of EGMA framework.☆16Updated 11 months ago
- ☆26Updated 6 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆42Updated 10 months ago
- An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)☆31Updated last year
- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding☆45Updated last month
- This repository is made for the paper: Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medica…☆42Updated 10 months ago
- ☆65Updated last month