dc3ea9f / RegionViTLinks
Unofficial implementation for "RegionViT: Regional-to-Local Attention for Vision Transformers"
☆10Updated 4 years ago
Alternatives and similar repositories for RegionViT
Users that are interested in RegionViT are comparing it to the libraries listed below
Sorting:
- The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration☆84Updated 3 years ago
- Official Codes and Pretrained Models for Dynamic MLP, CVPR2022, https://arxiv.org/abs/2203.03253☆88Updated 3 years ago
- ☆25Updated 4 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Updated 3 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆118Updated 3 years ago
- ☆72Updated 10 months ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 4 years ago
- ☆59Updated 3 years ago
- code base for vision transformers☆36Updated 4 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Updated 7 months ago
- The official PyTorch implementation of oral paper "FocusCut: Diving into a Focus View in Interactive Segmentation" in CVPR 2022.☆29Updated 2 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆61Updated 2 years ago
- This repo contains the code of "ConTNet: Why not use convolution and transformer at the same time?"☆98Updated 4 years ago
- The official pytorch implementation of ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias☆104Updated 3 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆103Updated 2 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆92Updated 3 years ago
- [ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation☆56Updated 3 years ago
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆30Updated 2 years ago
- Open Source Neural Architecture Search Toolbox for Device-aware Image Dense Prediction & Official implementation of ICCV2021 "iNAS: Integ…☆84Updated 3 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆66Updated 9 months ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆54Updated 3 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆158Updated 4 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Updated 4 years ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 3 years ago
- ☆27Updated 3 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆95Updated 3 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 3 years ago
- Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"☆77Updated last year
- A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"☆58Updated 4 years ago