MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.
☆78Updated 10 months ago
Alternatives and similar repositories for mamba-mini:
Users that are interested in mamba-mini are comparing it to the libraries listed below
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 7 months ago
- Code Implementation of EfficientVMamba☆191Updated 9 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 7 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆222Updated 8 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆60Updated 3 weeks ago
- ☆55Updated 6 months ago
- Open source implementation of "Vision Transformers Need Registers"☆162Updated 2 months ago
- Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆395Updated 2 months ago
- Awesome list of papers that extend Mamba to various applications.☆129Updated 3 weeks ago
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated 10 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆192Updated 9 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆30Updated 2 months ago
- Introduce Mamba2 to Vision.☆111Updated 4 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆71Updated 5 months ago
- Official repository of MLLA (NeurIPS 2024)☆267Updated last month
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆209Updated 7 months ago
- Causal depthwise conv1d in CUDA, with a PyTorch interface☆378Updated last month
- ☆77Updated last year
- A curated list of papers on the applications of RWKV in computer vision.☆140Updated this week
- Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"☆64Updated 5 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆118Updated 5 months ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆215Updated 4 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆86Updated 4 months ago
- Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆65Updated 2 months ago
- A repository for DenseSSMs☆87Updated 9 months ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆55Updated 6 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆99Updated last month
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆79Updated 9 months ago
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆93Updated 3 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆88Updated last year