MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.
☆80Updated last year
Alternatives and similar repositories for mamba-mini:
Users that are interested in mamba-mini are comparing it to the libraries listed below
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆63Updated 3 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆70Updated 9 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆91Updated 9 months ago
- Introduce Mamba2 to Vision.☆122Updated 6 months ago
- ☆61Updated 3 weeks ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆236Updated 10 months ago
- Code Implementation of EfficientVMamba☆202Updated 11 months ago
- Open source implementation of "Vision Transformers Need Registers"☆168Updated last month
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 7 months ago
- Awesome list of papers that extend Mamba to various applications.☆132Updated 3 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆425Updated last month
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆100Updated 6 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆95Updated 2 weeks ago
- A curated list of papers on the applications of RWKV in computer vision.☆160Updated last month
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆219Updated 6 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆191Updated 5 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆73Updated last month
- Official repository of InLine attention (NeurIPS 2024)☆44Updated 3 months ago
- Official repository of MLLA (NeurIPS 2024)☆291Updated 3 months ago
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆104Updated 5 months ago
- Official repository of Slide-Transformer (CVPR2023)☆167Updated 6 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆102Updated 3 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆199Updated 11 months ago
- A repository for DenseSSMs☆87Updated 11 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆216Updated 9 months ago
- ☆63Updated 2 years ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆125Updated last month
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆113Updated 2 weeks ago