Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
☆2,948Mar 8, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-minimal
Users that are interested in mamba-minimal are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mamba SSM architecture☆18,275May 10, 2026Updated 2 weeks ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,460May 3, 2026Updated 3 weeks ago
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,869Feb 13, 2025Updated last year
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- Structured state space sequence models☆2,899Jul 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆940Mar 3, 2024Updated 2 years ago
- VMamba: Visual State Space Models,code is based on mamba☆3,160Mar 7, 2025Updated last year
- Awesome Papers related to Mamba.☆1,397Oct 17, 2024Updated last year
- High-speed Large Language Model Serving for Local Deployment☆9,469May 11, 2026Updated 2 weeks ago
- 🚀 Efficient implementations for emerging model architectures☆5,116May 17, 2026Updated last week
- Collection of papers on state-space models☆621Nov 4, 2025Updated 6 months ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,536Updated this week
- Fast and memory-efficient exact attention☆23,917Updated this week
- ☆36Nov 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Minimal Mamba-2 implementation in PyTorch☆252Jun 17, 2024Updated last year
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆254Jun 6, 2025Updated 11 months ago
- Some preliminary explorations of Mamba's context scaling.☆219Feb 8, 2024Updated 2 years ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"