changsn / STViT-RLinks
This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".
☆34Updated 3 months ago
Alternatives and similar repositories for STViT-R
Users that are interested in STViT-R are comparing it to the libraries listed below
Sorting:
- ☆85Updated last year
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆101Updated last year
- ☆65Updated 2 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆105Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆55Updated last year
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆142Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆77Updated 5 months ago
- ☆28Updated last year
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆95Updated last month
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆75Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆49Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated 7 months ago
- ☆34Updated last year
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆59Updated 4 months ago
- Codes for ECCV2022 paper - contrastive deep supervision☆69Updated 2 years ago
- ☆60Updated 2 years ago
- Content-aware Token Sharing applied to Segmenter☆21Updated 2 years ago
- Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.☆139Updated 2 years ago
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆81Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆64Updated 2 months ago
- [CVPR2022] PyTorch implementation of ''Background Activation Suppression for Weakly Supervised Object Localization''.☆45Updated last year
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆77Updated last month
- ☆22Updated 2 years ago
- Official Implementation of "FP-DETR: Detection Transformer Advanced by Fully Pre-training"☆61Updated 3 years ago
- ☆142Updated 11 months ago
- [CVPR'23] Official PyTorch implementation of Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification…☆42Updated last year