OliverRensu / ARM
This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆64Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ARM
- ☆48Updated 5 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆83Updated 5 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆70Updated 3 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆78Updated 8 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated 6 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆88Updated 10 months ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆31Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 6 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆72Updated 2 months ago
- [CVPR 2024] Official implementation of "Adapters Strike Back"☆32Updated 3 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆65Updated 3 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆93Updated 3 months ago
- ☆32Updated last year
- ☆112Updated 5 months ago
- ☆75Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆27Updated 6 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆64Updated last month
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆33Updated 5 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆47Updated 3 months ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆66Updated 11 months ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆67Updated 10 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆30Updated 5 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆76Updated 9 months ago
- ☆25Updated 2 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆56Updated last month
- Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"☆64Updated last year
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆42Updated 6 months ago
- ☆21Updated last year
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆65Updated 3 months ago