[ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆90May 30, 2025Updated 9 months ago
Alternatives and similar repositories for ARM
Users that are interested in ARM are comparing it to the libraries listed below
Sorting:
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆99May 3, 2024Updated last year
- ☆81Feb 27, 2025Updated last year
- ☆59Jun 18, 2024Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆274May 6, 2024Updated last year
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆132Mar 22, 2025Updated 11 months ago
- ☆26Oct 15, 2024Updated last year
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆26Feb 6, 2026Updated 3 weeks ago
- [NeurIPS 2024] Official repository of MLLA☆371Jul 11, 2025Updated 7 months ago
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆15Jun 4, 2024Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- [ICCV2025] Introduce Mamba2 to Vision.☆185Oct 29, 2025Updated 4 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆86Apr 6, 2025Updated 10 months ago
- ☆27Feb 27, 2025Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Mod…☆482Feb 10, 2026Updated 3 weeks ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆116Jun 17, 2024Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆86Mar 21, 2024Updated last year
- ☆25Oct 15, 2024Updated last year
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆745Jun 28, 2025Updated 8 months ago
- ☆71Nov 18, 2024Updated last year
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Feb 9, 2026Updated 3 weeks ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆143Jan 13, 2025Updated last year
- ☆11Oct 26, 2021Updated 4 years ago
- Implementation of the Pairformer model used in AlphaFold 3☆14Feb 23, 2026Updated last week
- [ICCV 2023] Robust Object Modeling for Visual Tracking, Official Implementation☆47Jan 5, 2025Updated last year
- ✨✨Latest Papers on Vision Mamba and Related Areas☆382Apr 17, 2025Updated 10 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Jan 31, 2026Updated last month
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,805Feb 13, 2025Updated last year
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆27Jun 12, 2024Updated last year
- VMamba: Visual State Space Models,code is based on mamba☆3,054Mar 7, 2025Updated 11 months ago
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,034Feb 9, 2026Updated 3 weeks ago
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆12Jan 13, 2025Updated last year
- TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model…☆15Dec 27, 2018Updated 7 years ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 10 months ago
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year