XYPB / CLEFT
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICCAI 2024.
☆16Updated last month
Alternatives and similar repositories for CLEFT:
Users that are interested in CLEFT are comparing it to the libraries listed below
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆38Updated 3 months ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆13Updated 6 months ago
- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding☆41Updated last week
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆70Updated 7 months ago
- MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆21Updated 2 months ago
- ☆12Updated 2 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- Source code for "MEDIMP: 3D Medical Images with clinical Prompts from limited tabular data for renal transplantation", MIDL 2023, https:/…☆10Updated last year
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆41Updated 2 months ago
- The efficient tuning method for VLMs☆80Updated last year
- ☆21Updated 5 months ago
- The repo of ASGMVLP☆13Updated 8 months ago
- An offcial implementation for UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training☆28Updated this week
- ☆32Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆36Updated 2 weeks ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆16Updated 8 months ago
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆27Updated last year
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆33Updated last year
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆25Updated last year
- [ICML2024]The official implementation of SemiRES in PyTorch.☆24Updated 8 months ago
- ICLR 2023 and ICML 2023 paper☆20Updated 5 months ago
- SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgi…☆34Updated 6 months ago
- ☆17Updated last year
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆21Updated 4 months ago
- [CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?☆30Updated 4 months ago
- PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"☆47Updated 5 months ago
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆24Updated this week
- Learning Bottleneck Concepts in Image Classification (CVPR 2023)☆37Updated last year
- PyTorch code and pretrained weights for the UNIC models.☆27Updated 6 months ago
- ☆13Updated 5 months ago