mashaan14 / VisionTransformer-MNISTLinks
This notebook is designed to plot the attention maps of a vision transformer trained on MNIST digits.
☆38Updated 7 months ago
Alternatives and similar repositories for VisionTransformer-MNIST
Users that are interested in VisionTransformer-MNIST are comparing it to the libraries listed below
Sorting:
- [ICCV25] Official Implementation of LeGrad☆86Updated last year
- Self-Supervised Learning in PyTorch☆143Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆196Updated 2 years ago
- Visualizing representations with diffusion based conditional generative model.☆103Updated 2 years ago
- ☆190Updated 2 years ago
- ☆56Updated 2 years ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆161Updated 3 months ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆120Updated 2 years ago
- Timm model explorer☆42Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆155Updated last year
- This repo contains the implementation of VQGAN, Taming Transformers for High-Resolution Image Synthesis in PyTorch from scratch. I have a…☆39Updated last year
- Contrastive Reinforcement Learning☆56Updated last week
- A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'☆185Updated 3 years ago
- VQ-VAE/GAN implementation in pytorch-lightning☆50Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆110Updated last year
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆194Updated 2 years ago
- A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch☆41Updated last year
- Sparse Linear Concept Embeddings☆127Updated 9 months ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆166Updated 2 years ago
- ☆210Updated 2 years ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆114Updated last year
- My take on Flow Matching☆89Updated last year
- [ICML 2025] Implementation of Spatial Reasoning with Denoising Models☆85Updated 5 months ago
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆144Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆129Updated 9 months ago
- ☆57Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆392Updated last year
- ConceptAttention: A method for interpreting multi-modal diffusion transformers.☆410Updated last week
- Rebuild the Stable Diffusion Model in a single python script. Tutorial for Harvard ML from Scratch Series☆222Updated 11 months ago