mashaan14 / VisionTransformer-MNISTLinks
This notebook is designed to plot the attention maps of a vision transformer trained on MNIST digits.
☆38Updated 6 months ago
Alternatives and similar repositories for VisionTransformer-MNIST
Users that are interested in VisionTransformer-MNIST are comparing it to the libraries listed below
Sorting:
- [ICCV25] Official Implementation of LeGrad☆83Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆196Updated 2 years ago
- Visualizing representations with diffusion based conditional generative model.☆103Updated 2 years ago
- ☆190Updated 2 years ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆149Updated 2 months ago
- Code for Principal Masked Autoencoders☆30Updated 8 months ago
- ☆107Updated 8 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆128Updated 8 months ago
- My take on Flow Matching☆86Updated 11 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆42Updated 8 months ago
- ConceptAttention: A method for interpreting multi-modal diffusion transformers.☆354Updated last month
- Self-Supervised Learning in PyTorch☆142Updated last year
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆113Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆124Updated 8 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆109Updated last year
- DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements …☆328Updated last year
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆190Updated 7 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- Open source implementation of "Vision Transformers Need Registers"☆201Updated last month
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆144Updated last year
- Sparse Autoencoders for Stable Diffusion XL models.☆79Updated last month
- ☆26Updated last year
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆188Updated 2 years ago
- Sparse Linear Concept Embeddings☆126Updated 8 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆154Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆224Updated last year
- ☆56Updated 2 years ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆357Updated last week
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆47Updated last year
- Learning from synthetic data - code and models☆325Updated last year