DongZhouGu / arxiv-dailyLinks
arxiv-daily
☆80Updated 4 years ago
Alternatives and similar repositories for arxiv-daily
Users that are interested in arxiv-daily are comparing it to the libraries listed below
Sorting:
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆121Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆77Updated 2 months ago
- ☆75Updated last year
- A list of papers, codes and applications on multi-task learning.☆72Updated 2 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆28Updated last year
- ☆82Updated 9 months ago
- ☆84Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆77Updated last week
- GroupMixAttention and GroupMixFormer☆116Updated last year
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆95Updated last month
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆64Updated 3 weeks ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆249Updated last year
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆72Updated 9 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆105Updated 9 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆91Updated 9 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆92Updated 11 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆35Updated 6 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆53Updated 7 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆49Updated 2 months ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆104Updated 11 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆28Updated 2 months ago
- autoupdate paper list☆84Updated this week
- Official Pytorch implementation of Dynamic-Token-Pruning (ICCV2023)☆21Updated last year
- Simba☆207Updated last year
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆86Updated last year
- ☆130Updated 2 years ago
- ☆44Updated 3 months ago
- Code Implementation of EfficientVMamba☆211Updated last year
- ☆41Updated 7 months ago