dqshuai / MetaFormer
A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “CoAtNet: Marrying Convolution and Attention for All Data Sizes”
☆232Updated 2 years ago
Alternatives and similar repositories for MetaFormer:
Users that are interested in MetaFormer are comparing it to the libraries listed below
- Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.☆193Updated last year
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆243Updated 2 years ago
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆93Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆345Updated last year
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆286Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆493Updated last year
- ☆249Updated 2 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆231Updated 3 years ago
- ☆197Updated 7 months ago
- Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".☆334Updated last year
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆155Updated 2 years ago
- This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-N…☆394Updated 2 years ago
- Official PyTorch implementation of "ML-Decoder: Scalable and Versatile Classification Head" (2021)☆326Updated 2 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆173Updated 2 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆444Updated 9 months ago
- Code Release for MViTv2 on Image Recognition.☆418Updated 3 months ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆161Updated last year
- ☆138Updated 3 years ago
- iFormer: Inception Transformer☆245Updated 2 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆370Updated 3 years ago
- ☆214Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆340Updated last year
- ☆191Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- Self-supervised vIsion Transformer (SiT)☆326Updated 2 years ago
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆342Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆427Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆195Updated 2 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆569Updated last year
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆643Updated 3 years ago