joyfang1106 / MRLALinks
Multi-head Recurrent Layer Attention for Vision Network
☆19Updated 2 years ago
Alternatives and similar repositories for MRLA
Users that are interested in MRLA are comparing it to the libraries listed below
Sorting:
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆49Updated last year
- ☆33Updated 3 years ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆34Updated 3 months ago
- ☆27Updated 2 years ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆28Updated last year
- Official PyTorch implementation of "TDAM: Top-down attention module for CNNs"☆11Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆54Updated last year
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆31Updated 2 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆44Updated last year
- [ECCV2022] Gumbel Optimised Loss for Long Tailed Instance Segmentation.☆18Updated 2 years ago
- Switchable Online Knowledge Distillation☆18Updated 7 months ago
- Offical Code for Paper "Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation"☆17Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- ☆34Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆53Updated 7 months ago
- Deep Networks with Recurrent Layer Aggregation☆28Updated 3 years ago
- ☆33Updated 7 months ago
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated 2 years ago
- ☆28Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆46Updated 7 months ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Updated last year
- ☆23Updated 2 years ago
- ☆44Updated last year
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 3 years ago
- Generating Image Specific Text☆27Updated last year
- Codes for ECCV2022 paper - contrastive deep supervision☆69Updated 2 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆105Updated last year
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago