DongZhouGu / arxiv-daily
arxiv-daily
☆76Updated 3 years ago
Alternatives and similar repositories for arxiv-daily:
Users that are interested in arxiv-daily are comparing it to the libraries listed below
- GroupMixAttention and GroupMixFormer☆115Updated last year
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆102Updated 4 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆226Updated 9 months ago
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆108Updated last year
- ☆83Updated last year
- Vision Mamba: A Comprehensive Survey and Taxonomy☆83Updated 5 months ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆87Updated 8 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆197Updated last year
- Code Implementation of EfficientVMamba☆194Updated 10 months ago
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆19Updated 11 months ago
- ☆72Updated last year
- Official repository of MLLA (NeurIPS 2024)☆271Updated 2 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆88Updated 5 months ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆39Updated 6 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆51Updated 7 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆67Updated 6 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆26Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆58Updated last month
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆98Updated 11 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆146Updated last month
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆185Updated 6 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆83Updated last month
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆85Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆72Updated 6 months ago
- FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba☆133Updated this week
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- Official Pytorch implementation of Dynamic-Token-Pruning (ICCV2023)☆19Updated last year
- ☆137Updated 11 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆194Updated 10 months ago