heliossun / STLLaVA-Med
Self-training LLaVA for medical
☆16Updated 4 months ago
Alternatives and similar repositories for STLLaVA-Med:
Users that are interested in STLLaVA-Med are comparing it to the libraries listed below
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆11Updated 7 months ago
- The efficient tuning method for VLMs☆80Updated 11 months ago
- [NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models☆38Updated 11 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆38Updated 2 months ago
- [ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition☆50Updated 6 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- About Code Release for "CLIPood: Generalizing CLIP to Out-of-Distributions" (ICML 2023), https://arxiv.org/abs/2302.00864☆64Updated last year
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆41Updated 8 months ago
- MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆24Updated 3 weeks ago
- Code for our NeurIPS´24 paper☆21Updated 4 months ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆47Updated 4 months ago
- ☆26Updated last year
- This repository houses the code for the paper - "The Neglected of VLMs"☆27Updated 2 months ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆75Updated 2 weeks ago
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆27Updated last year
- Towards Unified and Effective Domain Generalization☆30Updated last year
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆37Updated 3 months ago
- Domain Generalization through Distilling CLIP with Language Guidance☆27Updated last year
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆66Updated 9 months ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆39Updated 10 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆70Updated 7 months ago
- Visual self-questioning for large vision-language assistant.☆40Updated 5 months ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆13Updated last year
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆36Updated 7 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆61Updated last month
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆80Updated 10 months ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆45Updated 7 months ago
- [CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning☆21Updated last year
- [ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning☆47Updated last month
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆87Updated last year