jacobgil/vit-explain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jacobgil/vit-explain)

jacobgil / vit-explain

Explainability for Vision Transformers

☆1,068

Alternatives and similar repositories for vit-explain

Users that are interested in vit-explain are comparing it to the libraries listed below

Sorting:

hila-chefer / Transformer-Explainability
View on GitHub
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …
☆1,976Jan 24, 2024Updated 2 years ago
samiraabnar / attention_flow
View on GitHub
☆265Sep 9, 2021Updated 4 years ago
hila-chefer / Transformer-MM-Explainability
View on GitHub
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…
☆901Aug 24, 2023Updated 2 years ago
sayakpaul / probing-vits
View on GitHub
Probing the representations of Vision Transformers.
☆339Oct 5, 2022Updated 3 years ago
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,122Jun 7, 2022Updated 3 years ago
jacobgil / pytorch-grad-cam
View on GitHub
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…
☆12,643Apr 7, 2025Updated 10 months ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,325Mar 15, 2024Updated last year
facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,459Jul 3, 2024Updated last year
google-research / vision_transformer
View on GitHub
☆12,318Jan 30, 2026Updated last month
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,397Feb 23, 2026Updated last week
frgfm / torch-cam
View on GitHub
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-…
☆2,288Dec 15, 2025Updated 2 months ago
luo3300612 / Visualizer
View on GitHub
assistant tools for attention visualization in deep learning
☆1,261Jun 9, 2022Updated 3 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,230Jul 23, 2024Updated last year
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,566Jan 7, 2025Updated last year
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,024Sep 29, 2022Updated 3 years ago
yiyixuxu / TimeSformer-rolled-attention
View on GitHub
Visualizing the learned space-time attention using Attention Rollout
☆40Apr 1, 2022Updated 3 years ago
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,367Jun 1, 2024Updated last year
lukemelas / PyTorch-Pretrained-ViT
View on GitHub
Vision Transformer (ViT) in PyTorch
☆847Mar 2, 2022Updated 4 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆15,716Jul 24, 2024Updated last year
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,300Jan 8, 2023Updated 3 years ago
xxxnell / how-do-vits-work
View on GitHub
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆823Jul 14, 2022Updated 3 years ago
hamidkazemi22 / vit-visualization
View on GitHub
☆192Oct 12, 2023Updated 2 years ago
facebookresearch / vissl
View on GitHub
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
☆3,295Mar 3, 2024Updated 2 years ago
piotr-komorowski / towards-evaluating-explanations-of-vit
View on GitHub
[XAI4CV CVPR 2023] Towards Evaluating Explanations of Vision Transformers for Medical Imaging
☆10Dec 1, 2023Updated 2 years ago
cmhungsteve / Awesome-Transformer-Attention
View on GitHub
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
☆5,013Jul 30, 2024Updated last year
raoyongming / DynamicViT
View on GitHub
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆651Jul 11, 2023Updated 2 years ago
DirtyHarryLYL / Transformer-in-Vision
View on GitHub
Recent Transformer-based CV and related works.
☆1,340Aug 22, 2023Updated 2 years ago
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆32,642Feb 18, 2026Updated last week
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆13,430Updated this week
AntixK / PyTorch-Model-Compare
View on GitHub
Compare neural networks by their feature similarity
☆377May 17, 2023Updated 2 years ago
google-research / big_vision
View on GitHub
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,368May 19, 2025Updated 9 months ago
KaiyangZhou / CoOp
View on GitHub
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
☆2,179May 20, 2024Updated last year
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,887Oct 27, 2022Updated 3 years ago
Muzammal-Naseer / IPViT
View on GitHub
Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)
☆183Aug 9, 2022Updated 3 years ago
facebookresearch / ToMe
View on GitHub
A method to increase the speed and lower the memory footprint of existing vision transformers.
☆1,170Jun 17, 2024Updated last year
Sara-Ahmed / SiT
View on GitHub
Self-supervised vIsion Transformer (SiT)
☆337Dec 24, 2022Updated 3 years ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆12,427Updated this week
CompVis / taming-transformers
View on GitHub
Taming Transformers for High-Resolution Image Synthesis
☆6,434Jul 30, 2024Updated last year
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,297Feb 19, 2026Updated last week