changsn / STViT-R
This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".
☆33Updated last month
Alternatives and similar repositories for STViT-R:
Users that are interested in STViT-R are comparing it to the libraries listed below
- ☆85Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 10 months ago
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆99Updated 10 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- ☆27Updated last year
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆34Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆34Updated last year
- ☆33Updated last year
- ☆47Updated 2 years ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆65Updated 2 months ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆94Updated 2 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 5 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆134Updated 4 months ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆53Updated last year
- Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations☆178Updated last year
- Novel Class Discovery in Semantic Segmentation. CVPR 2022☆68Updated 2 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆138Updated last year
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆90Updated last year
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆82Updated 2 years ago
- ☆63Updated 2 years ago
- ☆44Updated 10 months ago
- Official Implementation of AlignMixup - CVPR 2022☆72Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆29Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated last year
- ☆37Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆41Updated 5 months ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago