PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆112Updated 4 months ago
Alternatives and similar repositories for mamba-tiny:
Users that are interested in mamba-tiny are comparing it to the libraries listed below
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆123Updated last month
- Annotated version of the Mamba paper☆474Updated last year
- Reading list for research topics in state-space models☆263Updated last month
- ☆284Updated last month
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆214Updated 9 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆158Updated last month
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆273Updated 3 months ago
- Implementation of the proposed minGRU in Pytorch☆281Updated 2 weeks ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆114Updated 9 months ago
- ☆161Updated 3 months ago
- Normalized Transformer (nGPT)☆156Updated 3 months ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆364Updated 9 months ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆77Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆101Updated 3 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆61Updated 2 months ago
- Some preliminary explorations of Mamba's context scaling.☆213Updated last year
- Benchmarking and Testing FastKAN☆71Updated 9 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆187Updated last month
- Evaluating the Mamba architecture on the Othello game☆44Updated 10 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆93Updated 6 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 7 months ago
- Awesome list of papers that extend Mamba to various applications.☆131Updated 2 months ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆161Updated last year
- Accelerated First Order Parallel Associative Scan☆172Updated 6 months ago
- [ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule☆137Updated last week
- A State-Space Model with Rational Transfer Function Representation.☆77Updated 9 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆51Updated 10 months ago
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆74Updated 10 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".☆37Updated 3 months ago
- Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)☆44Updated last year