MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.
☆81Updated last year
Alternatives and similar repositories for mamba-mini:
Users that are interested in mamba-mini are comparing it to the libraries listed below
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆92Updated 9 months ago
- ☆62Updated last month
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆432Updated last month
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆71Updated 9 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆82Updated last week
- Code Implementation of EfficientVMamba☆203Updated 11 months ago
- Open source implementation of "Vision Transformers Need Registers"☆166Updated 2 months ago
- Introduce Mamba2 to Vision.☆123Updated 7 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆64Updated 3 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆237Updated 10 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 7 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆106Updated 3 months ago
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year
- Awesome list of papers that extend Mamba to various applications.☆132Updated 3 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆33Updated 4 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆100Updated 7 months ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆219Updated 7 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆100Updated 3 weeks ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆186Updated 9 months ago
- A repository for DenseSSMs☆87Updated 11 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆193Updated 6 months ago
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆105Updated 5 months ago
- ☆84Updated last year
- Official repository of InLine attention (NeurIPS 2024)☆44Updated 3 months ago
- Causal depthwise conv1d in CUDA, with a PyTorch interface☆422Updated 3 months ago
- Official repository of MLLA (NeurIPS 2024)☆300Updated 4 months ago
- ☆35Updated 8 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆202Updated 11 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆216Updated 10 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆163Updated last month