mashaan14 / VisionTransformer-MNISTLinks
This notebook is designed to plot the attention maps of a vision transformer trained on MNIST digits.
☆37Updated 3 weeks ago
Alternatives and similar repositories for VisionTransformer-MNIST
Users that are interested in VisionTransformer-MNIST are comparing it to the libraries listed below
Sorting:
- ☆74Updated 8 months ago
- Visualizing representations with diffusion based conditional generative model.☆95Updated 2 years ago
- ☆184Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆129Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆193Updated 2 years ago
- This repo contains the implementation of VQGAN, Taming Transformers for High-Resolution Image Synthesis in PyTorch from scratch. I have a…☆36Updated 10 months ago
- Personalized Representation from Personalized Generation (ICLR 2025)☆64Updated 3 months ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆155Updated last year
- Simple CIFAR-10 classification with ConvMixer☆45Updated 3 years ago
- ☆50Updated last year
- My take on Flow Matching☆64Updated 5 months ago
- Open source implementation of "Vision Transformers Need Registers"☆182Updated 2 months ago
- Implementation of Diffusion Transformer Model in Pytorch☆61Updated last month
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆148Updated last month
- ☆51Updated last year
- In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results w…☆15Updated last year
- Reproduction of DDPO paper (RLHF for diffusion)☆88Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆102Updated last year
- Code release for "Improved baselines for vision-language pre-training"☆60Updated last year
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆368Updated 7 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 9 months ago
- Simple MAE (masked autoencoders) with pytorch and pytorch-lightning.☆43Updated last year
- Sparse Autoencoders for Stable Diffusion XL models.☆65Updated last week
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆168Updated 2 years ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆44Updated 8 months ago
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆110Updated 2 months ago
- Autoregressive Image Generation☆32Updated 2 weeks ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆114Updated 2 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆38Updated 2 months ago