damo-cv / KVT
☆31Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for KVT
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆53Updated 2 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆101Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆44Updated last year
- ☆32Updated 2 years ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- ☆58Updated 2 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆92Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆47Updated last year
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated last year
- ☆18Updated last year
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆17Updated last year
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆15Updated 9 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆32Updated last year
- Official implementation of CVPR2024 paper "Enhance Image Classification via Inter-class Image Mixup with Diffusion Model""☆25Updated 2 months ago
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- Official implementation of the paper ``W2N: Switching From Weak Supervision to Noisy Supervision for Object Detection"☆28Updated 2 years ago
- ☆132Updated 2 months ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆115Updated last year
- ☆27Updated 11 months ago
- ☆48Updated 9 months ago
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆27Updated 6 months ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆47Updated last month
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆94Updated 6 months ago
- This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…☆51Updated last year
- ☆33Updated last year
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆47Updated 7 months ago
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆59Updated last year
- ☆22Updated last year