tommyip / mamba2-minimalLinks

Minimal Mamba-2 implementation in PyTorch

☆236

Alternatives and similar repositories for mamba2-minimal

Users that are interested in mamba2-minimal are comparing it to the libraries listed below

Sorting:

Dao-AILab / causal-conv1d
Causal depthwise conv1d in CUDA, with a PyTorch interface
☆661Updated last month
nanowell / Differential-Transformer-PyTorch
PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …
☆83Updated last year
Hprairie / Bi-Mamba2
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆75Updated 11 months ago
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆56Updated last month
AmeenAli / HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆231Updated last month
YihongDong / FAN
☆252Updated last month
XiudingCai / Awesome-Mamba-Collection
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.
☆673Updated 3 months ago
vasqu / mamba2-torch
☆52Updated last year
MzeroMiko / mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…
☆98Updated last month
Event-AHU / Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
☆745Updated 5 months ago
radarFudan / Awesome-state-space-models
Collection of papers on state-space models
☆609Updated last month
kyegomez / MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
☆211Updated last month
badripatro / simba
Simba
☆215Updated last year
LeapLabTHU / MLLA
Official repository of MLLA (NeurIPS 2024)
☆362Updated 4 months ago
alxndrTL / mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
☆1,377Updated last year
kyegomez / SwitchTransformers
Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…
☆134Updated last month
kyegomez / Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
☆198Updated last month
akaashdash / kansformers
☆140Updated last year
pengzhangzhi / Awesome-Mamba
Awesome list of papers that extend Mamba to various applications.
☆139Updated 5 months ago
Adamdad / rational_kat_cu
☆76Updated 10 months ago
hkproj / mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
☆175Updated last year
goombalab / hydra
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
☆166Updated 10 months ago
myscience / x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
☆179Updated last year
tensorgi / TPA
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
☆431Updated last month
AvivBick / awesome-ssm-ml
Reading list for research topics in state-space models
☆335Updated 5 months ago
kyegomez / MoE-Mamba
Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…
☆118Updated last month
chenziwenhaoshuai / Vision-KAN
KAN for Vision Transformer
☆253Updated last year
muditbhargava66 / PyxLSTM
Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.
☆302Updated last year
NX-AI / vision-lstm
xLSTM as Generic Vision Backbone
☆490Updated last month
kyleliang919 / C-Optim
When it comes to optimizers, it's always better to be safe than sorry
☆389Updated 2 months ago