ViTAE-Transformer / ViTAE-VSALinks
The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"
☆157Updated last month
Alternatives and similar repositories for ViTAE-VSA
Users that are interested in ViTAE-VSA are comparing it to the libraries listed below
Sorting:
- The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vis…☆279Updated last month
- A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, …☆231Updated 2 years ago
- Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"☆575Updated 3 years ago
- [CVPR 2023] Referring Image Matting☆208Updated 2 years ago
- A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, cod…☆482Updated last year
- The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"☆455Updated 8 months ago
- [ACM MM 2021] Privacy-Preserving Portrait Matting☆306Updated 2 years ago
- Not All Pixels Are Equal: Learning Hardness Probability for Semantic Segmentation.☆36Updated 2 years ago
- [IJCAI'21] Deep Automatic Natural Image Matting☆400Updated 2 years ago
- ☆216Updated 3 years ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆206Updated last year
- ☆31Updated last year
- [CVPR 2023 & TPAMI 2025] Explicit Visual Prompting for Low-Level Structure Segmentations☆215Updated last week
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆211Updated 2 years ago
- ☆185Updated 9 months ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆289Updated last year
- A curated list of awesome resources for salient object detection (SOD), focusing more on multi-modal SOD, such as RGB-D SOD.☆134Updated last year
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆217Updated 4 months ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆244Updated 2 years ago
- This is the pytorch implementation of FCL-Net, accepted by NN'2022.☆14Updated 3 years ago
- ☆148Updated last year
- Official repository of Slide-Transformer (CVPR2023)☆172Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆142Updated 3 years ago
- ☆70Updated 2 years ago
- [NeurIPS2024] Official Pytorch Implementation of SSA-Seg☆45Updated 11 months ago
- The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretatio…☆53Updated 11 months ago
- ☆133Updated 2 years ago
- A more robust Unsupervised Salient Object Detection (USOD) framework.☆48Updated last year
- Concealed Scene Understanding, Visual Intelligence (VI), 2023☆70Updated 2 months ago
- ☆61Updated 3 years ago