Yangr116 / ScalableViT
This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"
☆26Updated last year
Alternatives and similar repositories for ScalableViT:
Users that are interested in ScalableViT are comparing it to the libraries listed below
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆52Updated 9 months ago
- ☆33Updated 3 years ago
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆39Updated 4 months ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- ☆72Updated last month
- (NeurIPS'22) SAPA: Similarity-Aware Point Affiliation for Feature Upsampling☆36Updated last year
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆54Updated 3 years ago
- ☆43Updated 2 years ago
- ☆130Updated 2 years ago
- Adaptive Pyramid Context Network for Semantic Segmentation (APCNet CVPR'2019)☆23Updated 3 years ago
- PyTorch implementation of PaCa-ViT (CVPR'23)☆29Updated last year
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆69Updated 2 years ago
- ☆35Updated 2 years ago
- RF-Next: Efficient Receptive Field Search for CNN(TPAMI2022, CVPR2021) Try it, you wouldn't regret it!☆63Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆75Updated last week
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆63Updated 2 weeks ago
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆27Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆59Updated 2 months ago
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated 2 years ago
- One-to-Few Label Assignment for End-to-End Dense Detection (CVPR2023)☆39Updated 2 years ago
- ☆57Updated 3 years ago
- [CVPR 2022] TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing☆46Updated 2 years ago
- Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model☆17Updated 9 months ago
- ☆59Updated 3 years ago
- Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)☆98Updated 3 years ago
- ☆73Updated last year