mashaan14 / VisionTransformer-MNIST
This notebook is designed to plot the attention maps of a vision transformer trained on MNIST digits.
☆36Updated 2 months ago
Alternatives and similar repositories for VisionTransformer-MNIST:
Users that are interested in VisionTransformer-MNIST are comparing it to the libraries listed below
- ☆65Updated 6 months ago
- ☆184Updated last year
- Visualizing representations with diffusion based conditional generative model.☆94Updated last year
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆108Updated last year
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- 👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)☆62Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆186Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆123Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 7 months ago
- [NeurIPS 2024] Code for the paper: B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable.☆30Updated last month
- ☆211Updated last year
- Self-Supervised Learning in PyTorch☆136Updated last year
- Generate text captions for images from their embeddings.☆106Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆107Updated 2 weeks ago
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆67Updated 10 months ago
- Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting☆26Updated 10 months ago
- Uncertainty-aware representation learning (URL) benchmark☆102Updated last month
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆87Updated 11 months ago
- Code release for "Improved baselines for vision-language pre-training"☆60Updated 11 months ago
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆42Updated 7 months ago
- Probing the representations of Vision Transformers.☆324Updated 2 years ago
- ☆201Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆102Updated 10 months ago
- DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation (NeurIPS 2024)☆29Updated 2 months ago
- Sparse Autoencoders for Stable Diffusion XL models.☆54Updated 2 weeks ago
- ☆18Updated 10 months ago
- Official implementation of MOST: Multiple object localization with self-supervised transformers published at ICCV 2023☆17Updated last year
- Simple MAE (masked autoencoders) with pytorch and pytorch-lightning.☆42Updated last year
- Implementation of diffusion models in pytorch for custom training.☆32Updated 2 years ago
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆27Updated 3 months ago