DongZhouGu / arxiv-daily
arxiv-daily
☆79Updated 3 years ago
Alternatives and similar repositories for arxiv-daily:
Users that are interested in arxiv-daily are comparing it to the libraries listed below
- GroupMixAttention and GroupMixFormer☆116Updated last year
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆86Updated last year
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆110Updated 7 months ago
- ☆84Updated last year
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆98Updated 10 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆77Updated last month
- A curated list of papers on the applications of RWKV in computer vision.☆171Updated 2 weeks ago
- Official repository of Slide-Transformer (CVPR2023)☆169Updated 8 months ago
- A list of papers, codes and applications on multi-task learning.☆71Updated last month
- Official repository of MLLA (NeurIPS 2024)☆320Updated 5 months ago
- Code Implementation of EfficientVMamba☆207Updated last year
- ☆131Updated 2 years ago
- ☆79Updated 8 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆191Updated 9 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆247Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆104Updated 8 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆91Updated 8 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆27Updated last year
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆209Updated last year
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆211Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆53Updated 10 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆202Updated last year
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year
- The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, published in ICML 2024)☆100Updated 10 months ago
- ☆66Updated 2 years ago
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆40Updated 6 months ago
- ☆142Updated 8 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"