AmeenAli / HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆211Updated 8 months ago
Alternatives and similar repositories for HiddenMambaAttn:
Users that are interested in HiddenMambaAttn are comparing it to the libraries listed below
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆123Updated 3 weeks ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆60Updated 2 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆157Updated 3 weeks ago
- Minimal Mamba-2 implementation in PyTorch☆172Updated 8 months ago
- Simba☆201Updated 10 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling