jeongukjae / CLIP-self-attention-visualizationLinks

Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.

☆46

Alternatives and similar repositories for CLIP-self-attention-visualization

Users that are interested in CLIP-self-attention-visualization are comparing it to the libraries listed below

Sorting:

geekyutao / TaskRes
Task Residual for Tuning Vision-Language Models (CVPR 2023)
☆73Updated 2 years ago
facebookresearch / genecis
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆59Updated 2 years ago
lezhang7 / Enhance-FineGrained
[CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
☆50Updated 2 months ago
vishaal27 / SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
☆102Updated last year
mzhaoshuai / RLCF
[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.
☆80Updated 11 months ago
mlvlab / RPO
Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023
☆53Updated last year
vinid / neg_clip
NegCLIP.
☆32Updated 2 years ago
jameelhassan / PromptAlign
[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
☆105Updated last year
zycheiheihei / Transferable-Visual-Prompting
[CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…
☆44Updated 6 months ago
allenai / close
☆59Updated last year
UCSC-VLAA / CLIPS
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆37Updated 2 months ago
arijitray1993 / COLA
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆24Updated 7 months ago
wuw2019 / LoTLIP
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆43Updated 5 months ago
showlab / datacentric.vlp
Compress conventional Vision-Language Pre-training data
☆51Updated last year
ant-research / DreamLIP
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆132Updated last month
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆28Updated last year
Paranioar / UniPT
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
☆67Updated 8 months ago
shubhamprshr27 / NeglectedTailsVLM
This repository houses the code for the paper - "The Neglected of VLMs"
☆28Updated last month
ZhangYuanhan-AI / visual_prompt_retrieval
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
☆177Updated last year
sarahpratt / CuPL
☆191Updated 2 years ago
WalBouss / GEM
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆126Updated 2 months ago
mlvlab / DAPT
Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)
☆40Updated last year
sMamooler / CLIP_Explainability
code for studying OpenAI's CLIP explainability
☆32Updated 3 years ago
deepglint / ALIP
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
☆98Updated last year
miccunifi / Cross-the-Gap
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
☆47Updated 2 months ago
meetdavidwan / crg
PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"
☆34Updated last year
wjpoom / SPEC
[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
☆43Updated last week
NMS05 / Patch-Aligned-Contrastive-Learning
☆22Updated last year
Monoxide-Chen / uncertainty_retrieval
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆72Updated last year
ExplainableML / WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆57Updated last year