Yangr116 / ScalableViT
This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"
☆26Updated last year
Alternatives and similar repositories for ScalableViT:
Users that are interested in ScalableViT are comparing it to the libraries listed below
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- ☆128Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆52Updated 8 months ago
- ☆71Updated last week
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆139Updated 2 years ago
- ☆33Updated 3 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆63Updated 2 years ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆68Updated last year
- ☆43Updated 2 years ago
- (ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation☆56Updated 5 months ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆54Updated 2 years ago
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆58Updated last month
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆90Updated last year
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆16Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆47Updated last year
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆26Updated last year
- Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model☆16Updated 9 months ago
- Adaptive Pyramid Context Network for Semantic Segmentation (APCNet CVPR'2019)☆23Updated 3 years ago
- ☆57Updated 3 years ago
- ☆57Updated 2 years ago
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated last year
- ☆35Updated last year
- [CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance Segmentation☆77Updated last year
- Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)☆98Updated 3 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆58Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated 2 years ago
- ☆73Updated last year