Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
☆183Jan 7, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-notes
Users that are interested in mamba-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39Apr 5, 2024Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆133Oct 18, 2024Updated last year
- Notes on Direct Preference Optimization☆26Apr 14, 2024Updated 2 years ago
- Some preliminary explorations of Mamba's context scaling.☆219Feb 8, 2024Updated 2 years ago
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Awesome Papers related to Mamba.☆1,397Oct 17, 2024Updated last year
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,460May 3, 2026Updated 3 weeks ago
- Mamba SSM architecture☆18,275May 10, 2026Updated 2 weeks ago
- ☆52Jan 28, 2024Updated 2 years ago
- Collection of papers on state-space models☆621Nov 4, 2025Updated 6 months ago
- [IEEE TMM] T-Mamba: A unified framework with Long-Range Dependency in dual-domain for 2D & 3D Tooth Segmentation☆100Apr 1, 2026Updated last month
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆107Oct 14, 2025Updated 7 months ago
- A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.☆728Apr 19, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,946Mar 8, 2024Updated 2 years ago
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆67Jul 1, 2024Updated last year
- ☆67Oct 22, 2024Updated last year
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,869Feb 13, 2025Updated last year
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆232Oct 16, 2025Updated 7 months ago
- Reading list for research topics in state-space models☆363Apr 14, 2026Updated last month
- ☆10Aug 9, 2025Updated 9 months ago
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆752Jun 28, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Nov 28, 2023Updated 2 years ago
- ☆107Mar 9, 2024Updated 2 years ago
- ☆16Oct 20, 2025Updated 7 months ago
- ☆31Jul 2, 2023Updated 2 years ago
- blog☆17Oct 2, 2024Updated last year
- Evaluating the Mamba architecture on the Othello game☆49Apr 25, 2024Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆98Apr 10, 2024Updated 2 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆15Apr 5, 2024Updated 2 years ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆360May 28, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An annotated implementation of the Hyena Hierarchy paper☆34May 28, 2023Updated 2 years ago
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- ☆37Feb 26, 2024Updated 2 years ago
- VMamba: Visual State Space Models,code is based on mamba☆3,150Mar 7, 2025Updated last year
- code for our MSCA-Net☆18Mar 27, 2024Updated 2 years ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆956Nov 16, 2025Updated 6 months ago
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆28Nov 17, 2025Updated 6 months ago