xxxnell / how-do-vits-workView external linksLinks
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆824Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for how-do-vits-work
Users that are interested in how-do-vits-work are comparing it to the libraries listed below
Sorting:
- Code release for ConvNeXt model☆6,292Jan 8, 2023Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,365Jun 1, 2024Updated last year
- Learning Features with Parameter-Free Layers, ICLR 2022☆84May 3, 2023Updated 2 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,293Mar 3, 2024Updated last year
- Official DeiT repository☆4,323Mar 15, 2024Updated last year
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,023Sep 29, 2022Updated 3 years ago
- [ICLR2022] official implementation of UniFormer☆896Mar 29, 2024Updated last year
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,175May 15, 2024Updated last year
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,170Jun 17, 2024Updated last year
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆463May 9, 2022Updated 3 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,229Jul 23, 2024Updated last year
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,975Jan 24, 2024Updated 2 years ago
- ☆318Oct 26, 2022Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 3 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,997Mar 21, 2024Updated last year
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆939Apr 24, 2024Updated last year
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,989Jun 16, 2024Updated last year
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,443Jul 3, 2024Updated last year
- Omnivore: A Single Model for Many Visual Modalities☆571Nov 12, 2022Updated 3 years ago
- Implementation of ConvMixer for "Patches Are All You Need? 🤷"☆1,081Nov 11, 2022Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networks☆481Mar 31, 2023Updated 2 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Apr 17, 2022Updated 3 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆114Jan 28, 2026Updated 2 weeks ago
- [NeurIPS 2021] You Only Look at One Sequence☆906May 4, 2022Updated 3 years ago
- solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning☆1,550Jan 26, 2026Updated 2 weeks ago
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Jan 11, 2023Updated 3 years ago
- Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)☆1,012Oct 12, 2022Updated 3 years ago
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆751Nov 7, 2023Updated 2 years ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆24,993Updated this week
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆523Mar 14, 2023Updated 2 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,351Updated this week
- [NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation☆486Dec 16, 2021Updated 4 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆614Dec 13, 2022Updated 3 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆405Nov 10, 2023Updated 2 years ago