youngwanLEE / MPViT
[CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction
☆375Updated 3 years ago
Alternatives and similar repositories for MPViT:
Users that are interested in MPViT are comparing it to the libraries listed below
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆282Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆565Updated last year
- Official MegEngine implementation of RepLKNet☆275Updated 3 years ago
- Lite Vision Transformer (CVPR 2022)☆142Updated 2 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆230Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆601Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆193Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆242Updated 2 years ago
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆397Updated 2 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆354Updated last year
- [ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".☆506Updated 2 years ago
- ☆215Updated 3 years ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆255Updated 3 years ago
- Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.☆557Updated last year
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆504Updated 2 years ago
- ☆190Updated 2 years ago
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆285Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆162Updated last year
- ☆257Updated 2 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆554Updated 3 years ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆209Updated last year
- PyTorch Implementation of Sparse DETR☆171Updated last year
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆287Updated 3 years ago
- [NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation☆474Updated 3 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆973Updated 2 years ago
- Bottleneck Transformers for Visual Recognition☆278Updated 4 years ago
- Don't feel pain to use Deformable Convolution☆337Updated last year
- ☆194Updated 2 years ago
- PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)☆199Updated last year
- ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)☆207Updated last year