gyhandy / Hierarchy-CLIP
[CVPR 2023] Improving Zero-shot Generalization and Robustness of Multi-modal Models
☆29Updated last year
Related projects: ⓘ
- ☆25Updated 7 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆26Updated 2 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆35Updated last month
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning☆54Updated last month
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆35Updated last year
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆28Updated 8 months ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆37Updated 4 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆62Updated 10 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated 9 months ago
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆27Updated 9 months ago
- [CVPR 2024] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLA…☆53Updated 3 months ago
- ☆55Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆65Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆51Updated last month
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆24Updated 8 months ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated last year
- ☆20Updated 11 months ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆44Updated 11 months ago
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆160Updated 6 months ago
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆33Updated 2 months ago
- ☆21Updated 3 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆22Updated 3 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆36Updated 9 months ago
- PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"☆31Updated this week
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models.☆51Updated 2 months ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆45Updated 10 months ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆41Updated last week
- Official PyTorch implementation of "Masked Images Are Counterfactual Samples for Robust Fine-tuning" (CVPR 2023)☆12Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆37Updated 3 weeks ago