MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.
☆81Updated last year
Alternatives and similar repositories for mamba-mini:
Users that are interested in mamba-mini are comparing it to the libraries listed below
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆64Updated 3 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆71Updated 9 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆92Updated 9 months ago
- Open source implementation of "Vision Transformers Need Registers"☆166Updated 2 months ago
- Code Implementation of EfficientVMamba☆204Updated 11 months ago
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆33Updated 4 months ago
- Introduce Mamba2 to Vision.☆123Updated 7 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆437Updated last month
- ☆63Updated last month
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆202Updated 11 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆239Updated 10 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆84Updated last week
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 7 months ago
- ☆84Updated 2 years ago
- Official repository of MLLA (NeurIPS 2024)☆305Updated 4 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆163Updated last month
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆216Updated 10 months ago
- Awesome list of papers that extend Mamba to various applications.☆132Updated 3 months ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆296Updated 3 months ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆59Updated 9 months ago
- A repository for DenseSSMs☆87Updated 11 months ago
- Official repository of Slide-Transformer (CVPR2023)☆167Updated 7 months ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆219Updated 7 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆90Updated last year
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆56Updated last month
- Official repository of InLine attention (NeurIPS 2024)☆44Updated 3 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆194Updated 6 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆101Updated 7 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆100Updated 3 weeks ago