dqshuai / MetaFormerLinks
A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “CoAtNet: Marrying Convolution and Attention for All Data Sizes”
☆242Updated 3 years ago
Alternatives and similar repositories for MetaFormer
Users that are interested in MetaFormer are comparing it to the libraries listed below
Sorting:
- Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.☆210Updated 2 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆369Updated last year
- ☆263Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆244Updated 2 years ago
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆110Updated 2 years ago
- Official PyTorch implementation of "ML-Decoder: Scalable and Versatile Classification Head" (2021)☆348Updated 2 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆175Updated 2 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆156Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆516Updated 2 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆234Updated 3 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆198Updated 2 years ago
- ☆140Updated 3 years ago
- ☆202Updated last year
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆288Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networks☆480Updated 2 years ago
- ☆216Updated 3 years ago
- Code Release for MViTv2 on Image Recognition.☆443Updated 11 months ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆163Updated 2 years ago
- This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies in…☆164Updated 2 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆493Updated last year
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆559Updated 3 years ago
- Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"☆212Updated 2 years ago
- Code and models for mobile-former☆130Updated 3 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆142Updated 3 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆252Updated 2 years ago
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆342Updated last year
- iFormer: Inception Transformer☆246Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆289Updated 3 years ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆279Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆204Updated 3 years ago