(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆821Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for how-do-vits-work
Users that are interested in how-do-vits-work are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code release for ConvNeXt model☆6,326Jan 8, 2023Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,366Jun 1, 2024Updated last year
- Learning Features with Parameter-Free Layers, ICLR 2022☆84May 3, 2023Updated 2 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Mar 3, 2024Updated 2 years ago
- Official DeiT repository☆4,327Mar 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,176May 15, 2024Updated last year
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,174Jun 17, 2024Updated last year
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,243Jul 23, 2024Updated last year
- [ICLR2022] official implementation of UniFormer☆898Mar 29, 2024Updated last year
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,030Sep 29, 2022Updated 3 years ago
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,984Jan 24, 2024Updated 2 years ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆941Apr 24, 2024Updated last year
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆464May 9, 2022Updated 3 years ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,485Jul 3, 2024Updated last year
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,986Jun 16, 2024Updated last year
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,995Mar 21, 2024Updated 2 years ago
- Implementation of ConvMixer for "Patches Are All You Need? 🤷"☆1,081Nov 11, 2022Updated 3 years ago
- ☆318Oct 26, 2022Updated 3 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,538Mar 18, 2026Updated last week
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆523Mar 14, 2023Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆115Mar 13, 2024Updated 2 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Apr 17, 2022Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networks☆481Mar 31, 2023Updated 2 years ago
- Omnivore: A Single Model for Many Visual Modalities☆572Nov 12, 2022Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Jan 11, 2023Updated 3 years ago
- Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)☆1,012Oct 12, 2022Updated 3 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,782Jul 24, 2024Updated last year
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆750Nov 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆378Sep 16, 2022Updated 3 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 4 years ago
- solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning☆1,553Mar 16, 2026Updated last week
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆292Apr 25, 2022Updated 3 years ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,024Jul 30, 2024Updated last year
- [NeurIPS 2021] You Only Look at One Sequence☆908May 4, 2022Updated 3 years ago
- iFormer: Inception Transformer☆248Jan 14, 2023Updated 3 years ago