damo-cv / KVTLinks
☆36Updated 3 years ago
Alternatives and similar repositories for KVT
Users that are interested in KVT are comparing it to the libraries listed below
Sorting:
- The official implementation for ALOFT (CVPR 2023).☆57Updated 2 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Updated 3 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- ☆32Updated 3 years ago
- This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…☆62Updated 3 months ago
- ☆85Updated 2 years ago
- ☆62Updated 3 years ago
- ☆33Updated 4 years ago
- ☆149Updated last year
- A more robust Unsupervised Salient Object Detection (USOD) framework.☆48Updated last year
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆17Updated last year
- Official implement of "CAT: Cross Attention in Vision Transformer".☆167Updated 3 years ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆25Updated 2 years ago
- Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)☆28Updated 4 years ago
- ☆216Updated 3 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆104Updated 3 years ago
- ☆152Updated last year
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Updated 2 years ago
- Official ImageNet Model repository☆260Updated 2 years ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆57Updated last year
- Official implementation of the paper ``W2N: Switching From Weak Supervision to Noisy Supervision for Object Detection"☆29Updated 3 years ago
- How Much Position Information Do Convolutional Neural Networks Encode?☆11Updated 4 years ago
- Scattering Vision Transformer☆53Updated last year
- ☆77Updated 5 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Updated 2 years ago
- ☆118Updated 3 years ago
- ICCV2023论文代码汇总☆18Updated 2 years ago
- ☆31Updated 2 years ago
- Vision Transformers with Hierarchical Attention☆102Updated 3 months ago