CUHK-AIM-Group / MCPLLinks
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆12Updated last year
Alternatives and similar repositories for MCPL
Users that are interested in MCPL are comparing it to the libraries listed below
Sorting:
- ☆21Updated 6 months ago
- ☆43Updated last week
- The repo of ASGMVLP☆17Updated last year
- ☆19Updated last month
- ☆32Updated last year
- Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathy☆15Updated 2 years ago
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data☆23Updated 5 months ago
- Source code for the paper "A Medical Semantic-Assisted Transformer for Radiographic Report Generation"☆25Updated 2 years ago
- Rethinking Data Perturbation and Model Stabilization for Semi-supervised Medical Image Segmentation☆14Updated 2 years ago
- Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation☆16Updated last week
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆31Updated last year
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆57Updated 4 months ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆12Updated 8 months ago
- This repository contains the code accompanying the paper "A Self-Guided Framework for Radiology Report Generation", accepted by MICCAI 20…☆20Updated last year
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆17Updated 2 months ago
- ☆22Updated 2 years ago
- ☆20Updated 10 months ago
- This is the repository for the ICLR2023 accepted paper -- Medical Image Understanding With Pretrained VLM☆31Updated 2 years ago
- [CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?☆30Updated last year
- ☆25Updated last year
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆43Updated last month
- Offical code of Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training[ICML 2024]☆24Updated last year
- ☆24Updated last year
- ☆28Updated 7 months ago
- ☆18Updated last year
- ☆22Updated last year
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆15Updated last year
- Multi-Aspect Vision Language Pretraining - CVPR2024☆84Updated last year
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆40Updated 5 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆20Updated 3 weeks ago