mashaan14 / VisionTransformer-MNISTLinks
This notebook is designed to plot the attention maps of a vision transformer trained on MNIST digits.
☆36Updated last week
Alternatives and similar repositories for VisionTransformer-MNIST
Users that are interested in VisionTransformer-MNIST are comparing it to the libraries listed below
Sorting:
- ☆74Updated 7 months ago
- Self-Supervised Learning in PyTorch☆138Updated last year
- Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting☆28Updated 11 months ago
- Visualizing representations with diffusion based conditional generative model.☆95Updated 2 years ago
- ☆183Updated last year
- DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation (NeurIPS 2024)☆31Updated 3 months ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆111Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆192Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆127Updated last year
- This repo contains the implementation of VQGAN, Taming Transformers for High-Resolution Image Synthesis in PyTorch from scratch. I have a…☆35Updated 9 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆113Updated last month
- Simple MAE (masked autoencoders) with pytorch and pytorch-lightning.☆43Updated last year
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆66Updated 11 months ago
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆55Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆361Updated 6 months ago
- This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample …☆56Updated last year
- [ICML 2023] official implementation for "Input Perturbation Reduces Exposure Bias in Diffusion Models"☆116Updated 2 months ago
- Implementation of diffusion models in pytorch for custom training.☆32Updated 2 years ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆154Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 8 months ago
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆167Updated last year
- Personalized Representation from Personalized Generation (ICLR 2025)☆64Updated 3 months ago
- My take on Flow Matching☆57Updated 4 months ago
- Open source implementation of "Vision Transformers Need Registers"☆178Updated 2 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated last year
- A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch☆31Updated last year
- ☆18Updated 11 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆101Updated last year
- Sparse Autoencoders for Stable Diffusion XL models.☆63Updated 3 weeks ago
- (ICLR 2024) Code Release for Patch-DM☆45Updated last year