RuoyuChen10 / VPSLinks
[CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search
☆28Updated 2 months ago
Alternatives and similar repositories for VPS
Users that are interested in VPS are comparing it to the libraries listed below
Sorting:
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆77Updated 2 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆43Updated 5 months ago
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆25Updated 3 weeks ago
- Official implement of our work: Sim2Word: Explaining Similarity with Representative Attribute Words via Counterfactual Explanations, whic…☆16Updated last year
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆82Updated last year
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆101Updated last year
- ☆17Updated last year
- ECCV2024: Adversarial Prompt Tuning for Vision-Language Models☆26Updated 6 months ago
- One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models☆51Updated 5 months ago
- ☆41Updated 3 weeks ago
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆45Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- [ECCV 2022] "Adversarial Contrastive Learning via Asymmetric InfoNCE"☆23Updated 2 years ago
- Domain Generalization through Distilling CLIP with Language Guidance☆29Updated last year
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆36Updated 10 months ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆47Updated 10 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆45Updated 2 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆42Updated 11 months ago
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Updated 6 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆35Updated 2 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆104Updated last week
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆63Updated 4 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆22Updated 8 months ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆41Updated last year
- ☆37Updated 10 months ago
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆34Updated 10 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆26Updated 4 months ago
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆50Updated last year
- [ICLR 2025] MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs☆29Updated 2 months ago
- The official repository of ECCV 2024 paper "Outlier-Aware Test-time Adaptation with Stable Memory Replay"☆19Updated last week