Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.
☆50May 11, 2022Updated 4 years ago
Alternatives and similar repositories for CLIP-self-attention-visualization
Users that are interested in CLIP-self-attention-visualization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆13Feb 22, 2025Updated last year
- ☆46Oct 5, 2025Updated 7 months ago
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- What do CLIP Vision Transformers learn? Feature Visualization can show you!☆15Aug 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 23, 2026Updated last month
- Code of paper [CVPR'24: Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion?]☆26Apr 2, 2024Updated 2 years ago
- [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk☆15Mar 15, 2025Updated last year
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆56Aug 16, 2024Updated last year
- 2019년 군 입대 전 캡스톤 디자인 (yolov3)☆20Feb 21, 2023Updated 3 years ago
- ☆54Jul 31, 2022Updated 3 years ago
- [Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆478Mar 1, 2025Updated last year
- This code is for pose-guided human animation from a single image.☆16Jun 18, 2021Updated 4 years ago
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Deep Sensor Fusion with Pyramid Fusion Networks for 3D Semantic Segmentation☆12Apr 13, 2023Updated 3 years ago
- [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling☆11Jan 3, 2023Updated 3 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆908Aug 24, 2023Updated 2 years ago
- Official implementation of "URECA : Unique Region Caption Anything"☆58Jul 13, 2025Updated 10 months ago
- An effective image quality assessment framework combining Segment Anything (SAM). This is the official implementation of our paper.☆24Jun 29, 2023Updated 2 years ago
- AdaRefSR is a novel reference-based one-step diffusion super-resolution framework. Paper was accepted by ICLR2026.☆52May 19, 2026Updated last week
- ComfyUI Textual Inversion Training nodes using input images from workflow☆13Jul 21, 2025Updated 10 months ago
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆12Aug 1, 2025Updated 9 months ago
- Glaze is a tool to help artists to prevent their artistic styles from being learned and mimicked by new AI-art models such as MidJourney,…☆35Mar 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- Create reliability diagrams to quantify ML calibration.☆10Feb 1, 2022Updated 4 years ago
- ViT trained on COYO-Labeled-300M dataset☆33Nov 24, 2022Updated 3 years ago
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆67Apr 4, 2025Updated last year
- ☆13Jan 5, 2022Updated 4 years ago
- This repository contains the official code for the CVPR 2023 paper ``Adversarial Counterfactual Visual Explanations''☆47Mar 12, 2025Updated last year
- ☆673Nov 28, 2023Updated 2 years ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆33Jan 27, 2026Updated 4 months ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AAAI2025☆13Apr 18, 2025Updated last year
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 11 months ago
- ☆23Sep 28, 2023Updated 2 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆55Feb 1, 2024Updated 2 years ago
- [CVPR2024] Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation☆19Sep 3, 2024Updated last year
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.☆138Jan 1, 2025Updated last year