changsn / STViT-R
This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".
☆30Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for STViT-R
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆94Updated 6 months ago
- ☆81Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆101Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆42Updated last year
- Official Implementation of AlignMixup - CVPR 2022☆69Updated 2 years ago
- ☆58Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆64Updated 3 weeks ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆17Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆99Updated last year
- (CVPR2022) Official PyTorch Implementation of KDEP. Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-eff…☆61Updated 2 years ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆30Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 6 months ago
- ☆32Updated 11 months ago
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆81Updated 2 years ago
- Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations☆170Updated last year
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆87Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆52Updated 9 months ago
- ☆29Updated last year
- Official Implementation of "FP-DETR: Detection Transformer Advanced by Fully Pre-training"☆60Updated 2 years ago
- ☆43Updated 6 months ago
- ☆35Updated 2 years ago
- Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)☆160Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- AFNet(NeurIPS 2022)☆19Updated last year
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆92Updated 2 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆69Updated 2 years ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆53Updated 3 weeks ago
- Self-Supervised Video Representation Learning with Motion-Aware Masked Autoencoders☆23Updated 3 months ago
- Code for Part-Guided Relational Transformers for Fine-Grained Visual Recognition, appeared in TIP 2021☆21Updated last year
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆49Updated 2 years ago