ViTAE-Transformer / ViTAE-VSA
The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"
☆158Updated 2 years ago
Alternatives and similar repositories for ViTAE-VSA
Users that are interested in ViTAE-VSA are comparing it to the libraries listed below
Sorting:
- The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vis…☆273Updated 2 years ago
- [CVPR 2023] Referring Image Matting☆208Updated 2 years ago
- A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, …☆230Updated 2 years ago
- Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"☆560Updated 3 years ago
- The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"☆445Updated 2 months ago
- A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, cod…☆476Updated 11 months ago
- Not All Pixels Are Equal: Learning Hardness Probability for Semantic Segmentation.☆36Updated last year
- ☆216Updated 3 years ago
- [CVPR 2023] Explicit Visual Prompting for Low-Level Structure Segmentations☆204Updated last year
- ☆140Updated 10 months ago
- The official pytorch implementation of ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias☆104Updated 3 years ago
- ☆178Updated 4 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆202Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- ☆142Updated 8 months ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆242Updated 2 years ago
- ☆257Updated 2 years ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆209Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆277Updated last year
- ☆74Updated last year
- reproduction of semantic segmentation using masked autoencoder (mae)☆162Updated 3 years ago
- ☆131Updated 2 years ago
- ☆146Updated last year
- Official ImageNet Model repository☆250Updated 2 years ago
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆46Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆192Updated 9 months ago
- Code and models for mobile-former☆123Updated 2 years ago
- ☆65Updated 2 years ago
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆192Updated 3 weeks ago