OliverRensu / ARM
This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆69Updated 8 months ago
Alternatives and similar repositories for ARM:
Users that are interested in ARM are comparing it to the libraries listed below
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆90Updated 8 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 6 months ago
- ☆56Updated this week
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆73Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 10 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆90Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆80Updated 11 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆79Updated 6 months ago
- ☆122Updated 8 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆74Updated 6 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆98Updated 4 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆80Updated last year
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆33Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆56Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆52Updated last year
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆79Updated 11 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆100Updated 2 months ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆65Updated 2 weeks ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 10 months ago
- ☆32Updated last year
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆77Updated 3 weeks ago
- ☆81Updated last year
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆40Updated 2 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆36Updated last week
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning & 【IJCV 2025】Diffusion-Enhanced Test-time Adap…☆60Updated last month
- ☆21Updated last year
- ☆57Updated 6 months ago
- [CVPR 2024] Official implementation of "Adapters Strike Back"☆35Updated 7 months ago
- Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024☆26Updated 6 months ago