ViTAE-Transformer / ViTAE-VSALinks
The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"
☆158Updated 2 months ago
Alternatives and similar repositories for ViTAE-VSA
Users that are interested in ViTAE-VSA are comparing it to the libraries listed below
Sorting:
- The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vis…☆282Updated 2 months ago
- A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, …☆231Updated 2 years ago
- Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"☆580Updated 3 years ago
- [CVPR 2023] Referring Image Matting☆208Updated 2 years ago
- A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, cod…☆482Updated last year
- The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"☆457Updated 9 months ago
- [ACM MM 2021] Privacy-Preserving Portrait Matting☆308Updated 2 years ago
- Not All Pixels Are Equal: Learning Hardness Probability for Semantic Segmentation.☆36Updated 2 years ago
- ☆216Updated 3 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆218Updated 5 months ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆291Updated 2 years ago
- ☆186Updated 11 months ago
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆403Updated 3 years ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆213Updated 2 years ago
- ☆132Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆143Updated 3 years ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆264Updated 2 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆291Updated 3 years ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated 2 years ago
- [CVPR 2022] Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers☆222Updated 3 years ago
- A curated list of awesome resources for salient object detection (SOD), focusing more on multi-modal SOD, such as RGB-D SOD.☆138Updated last year
- ☆68Updated 2 years ago
- ☆61Updated 2 years ago
- This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"☆255Updated 4 years ago
- ☆196Updated 3 years ago
- ☆72Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆244Updated 3 years ago
- This is the code related to "Semantic-Aware Domain Generalized Segmentation" (CVPR 2022)☆157Updated 3 years ago
- Code and models for mobile-former☆131Updated 3 years ago
- Official ImageNet Model repository☆258Updated 2 years ago