dqshuai / MetaFormer
A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “CoAtNet: Marrying Convolution and Attention for All Data Sizes”
☆224Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for MetaFormer
- Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.☆190Updated last year
- ☆244Updated last year
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆228Updated 2 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆155Updated last year
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆91Updated last year
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆238Updated last year
- This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-N…☆389Updated 2 years ago
- Implementation of Convolutional enhanced image Transformer☆101Updated 3 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆484Updated last year
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆281Updated 2 years ago
- iFormer: Inception Transformer☆242Updated last year
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆627Updated 3 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆171Updated last year
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆425Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆330Updated 9 months ago
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆284Updated 2 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆150Updated 2 years ago
- Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".☆335Updated last year
- Code Release for MViTv2 on Image Recognition.☆400Updated last month
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆161Updated last year
- ☆210Updated 2 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆193Updated last year
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆369Updated 2 years ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆264Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆421Updated 5 months ago
- ☆136Updated 2 years ago
- This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies in…☆152Updated last year
- Lite Vision Transformer (CVPR 2022)☆134Updated 2 years ago
- ☆195Updated 3 months ago
- Official MegEngine implementation of RepLKNet☆268Updated 2 years ago