zer0int / CLIP-text-image-interpretability

Get CLIP ViT text tokens about an image, visualize attention as a heatmap.
10Updated last year

Alternatives and similar repositories for CLIP-text-image-interpretability:

Users that are interested in CLIP-text-image-interpretability are comparing it to the libraries listed below