Hprairie / Bi-Mamba2Links
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆75Updated 8 months ago
Alternatives and similar repositories for Bi-Mamba2
Users that are interested in Bi-Mamba2 are comparing it to the libraries listed below
Sorting:
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆158Updated 7 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆226Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"