XYPB / CLEFTLinks

Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICCAI 2024.

☆17

Alternatives and similar repositories for CLEFT

Users that are interested in CLEFT are comparing it to the libraries listed below

Sorting:

CUHK-AIM-Group / MCPL
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Updated last year
Tang-xiaoxiao / 3D-RAD
[ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
☆20Updated last month
LeapLabTHU / CheXWorld
[CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
☆29Updated 7 months ago
minghu0830 / OphNet-benchmark
[ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"
☆57Updated 4 months ago
richard-peng-xia / HGCLIP
[COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
☆43Updated last year
JerrryNie / ConceptCLIP
☆21Updated 6 months ago
UCSC-VLAA / MedVLThinker
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
☆38Updated 3 weeks ago
mbzuai-oryx / MIRA
[ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…
☆17Updated 3 months ago
zhaoziheng / OmniAbnorm-CT
Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach
☆17Updated last week
keke-nice / MedTVT-R1
☆23Updated 2 months ago
Harvard-Ophthalmology-AI-Lab / FairCLIP
[CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning
☆94Updated 4 months ago
yhygao / Explicd
☆16Updated last year
pumpkin805 / FALIP
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆17Updated last year
mzhaoshuai / RLCF
[ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.
☆95Updated last month
FereshteShakeri / few-shot-MedVLMs
☆32Updated last year
ASGMVLP / ASGMVLP_CODE
The repo of ASGMVLP
☆17Updated last year
ShawnHuang497 / BiRD
The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'
☆32Updated last year
longbai1006 / CAT-ViL
Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…
☆16Updated last year
MSIIP / Uni-Med
☆44Updated 2 weeks ago
CAMMA-public / SSG-VQA
[IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge
☆45Updated 6 months ago
visiondom / CHRF
☆19Updated 2 years ago
LinjieMu / MMXU
☆19Updated last month
icon-lab / MedTrim
Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"
☆12Updated 8 months ago
MIV-XJTU / FLAME
[CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"
☆32Updated 4 months ago
XuMengyaAmy / SwinMLP_TranCAP
☆13Updated 3 years ago
eric-ai-lab / ProbMed
[ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"
☆24Updated 9 months ago
nhussein / promptsmooth
Official implementation of the paper "PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning"
☆23Updated 7 months ago
aiming-lab / MMedPO
[ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
☆62Updated 5 months ago
SooLab / DDCOT
[NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models
☆49Updated last year
AI-in-Health / M3FM
[npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis
☆16Updated 9 months ago