Qualcomm-AI-research / skip-attentionLinks
☆21Updated last year
Alternatives and similar repositories for skip-attention
Users that are interested in skip-attention are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆29Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆55Updated 2 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Updated 3 years ago
- ☆32Updated 3 years ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆55Updated 10 months ago
- Trainable Highly-expressive Activation Functions. ECCV 2024☆37Updated 6 months ago
- ☆44Updated 2 years ago
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆40Updated last year
- ☆150Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆105Updated 2 years ago
- Implementation for Context-Gated Convolution☆59Updated 3 years ago
- ☆36Updated 2 years ago
- Code for paper " LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform" appeared in ECCV'20☆10Updated 4 years ago
- Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]☆14Updated last year
- ☆56Updated 11 months ago
- Vision Transformers with Hierarchical Attention☆102Updated 11 months ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆44Updated last year
- PyTorch implementation of PaCa-ViT (CVPR'23)☆31Updated 2 years ago
- ☆31Updated 5 months ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆52Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆79Updated 4 months ago
- ☆59Updated 2 years ago
- Official PyTorch implementation of "DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" (CVPR 2023)☆28Updated last year
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆17Updated last year
- ☆67Updated last year
- Switchable Online Knowledge Distillation☆19Updated 10 months ago
- Implementation of vision transformer. ⭐⭐⭐☆33Updated 3 years ago
- ☆12Updated 8 months ago
- Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)☆27Updated 4 years ago