joyfang1106 / MRLA
Multi-head Recurrent Layer Attention for Vision Network
☆19Updated 2 years ago
Alternatives and similar repositories for MRLA:
Users that are interested in MRLA are comparing it to the libraries listed below
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆34Updated 2 months ago
- Official PyTorch implementation of "TDAM: Top-down attention module for CNNs"☆11Updated 2 years ago
- ☆34Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆48Updated last year
- Deep Networks with Recurrent Layer Aggregation☆28Updated 3 years ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- ☆33Updated 3 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆44Updated 6 months ago
- The official implementation for ALOFT (CVPR 2023).☆54Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- [ECCV 2024] Official code release for "Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition"☆26Updated last month
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆27Updated last year
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆31Updated 2 years ago
- TCPNet☆30Updated 3 years ago
- A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆18Updated 3 years ago
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆20Updated 9 months ago
- ☆32Updated 6 months ago
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆29Updated last year
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Updated last year
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- ☆22Updated 2 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆75Updated last year
- ☆16Updated 3 years ago
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆16Updated last year
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆35Updated last year
- This is the implementation of our AURL paper "Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification".☆15Updated 2 years ago
- Switchable Online Knowledge Distillation☆18Updated 5 months ago
- Generating Image Specific Text☆27Updated last year
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago