Hprairie/Bi-Mamba2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hprairie/Bi-Mamba2)

Hprairie / Bi-Mamba2

A Triton Kernel for incorporating Bi-Directionality in Mamba2

☆83

Alternatives and similar repositories for Bi-Mamba2

Users that are interested in Bi-Mamba2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

goombalab / hydra
View on GitHub
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
☆175Jan 30, 2025Updated last year
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year
HazyResearch / prefix-linear-attention
View on GitHub
☆62Jul 9, 2024Updated 2 years ago
AvivBick / awesome-ssm-ml
View on GitHub
Reading list for research topics in state-space models
☆367May 18, 2026Updated 2 months ago
wangck20 / GlobalMamba
View on GitHub
☆27Oct 15, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago
alxndrTL / othello_mamba
View on GitHub
Evaluating the Mamba architecture on the Othello game
☆49Apr 25, 2024Updated 2 years ago
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
NicolasZucchet / minimal-LRU
View on GitHub
Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)
☆63Sep 3, 2025Updated 10 months ago
HazyResearch / based
View on GitHub
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
☆256Jun 6, 2025Updated last year
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
ag1988 / dlr
View on GitHub
The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…
☆23Dec 30, 2022Updated 3 years ago
goombalab / phi-mamba
View on GitHub
Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…
☆125Sep 13, 2024Updated last year
Zyphra / Zamba2
View on GitHub
PyTorch implementation of models from the Zamba2 series.
☆193Jan 23, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
YuHengsss / VSSD
View on GitHub
[ICCV2025] Introduce Mamba2 to Vision.
☆190Oct 29, 2025Updated 8 months ago
jzhang38 / LongMamba
View on GitHub
Some preliminary explorations of Mamba's context scaling.
☆221Feb 8, 2024Updated 2 years ago
Doraemonzzz / Awesome-Triton-Resources
View on GitHub
Awesome Triton Resources
☆43Apr 27, 2025Updated last year
ighodgao / mamba-speech-synthesis
View on GitHub
Jupyter Notebook running Mamba speech synthesis example on Determined AI. Based on https://2084.substack.com/p/2084-marcrandbot-speech-sy…
☆23Feb 8, 2024Updated 2 years ago
jnypark / VideoMamba
View on GitHub
☆27Jun 4, 2024Updated 2 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
TRI-ML / linear_open_lm
View on GitHub
A repository for research on medium sized language models.
☆78May 23, 2024Updated 2 years ago
renll / SeqBoat
View on GitHub
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆40Dec 2, 2023Updated 2 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
srush / mamba-scans
View on GitHub
Blog post
☆17Feb 16, 2024Updated 2 years ago
BlinkDL / LinearAttentionArena
View on GitHub
Here we will test various linear attention designs.
☆62Apr 25, 2024Updated 2 years ago
JeongHun0716 / e-mvsr
View on GitHub
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)
☆20Mar 17, 2025Updated last year
FarnoushRJ / MambaLRP
View on GitHub
[NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍
☆48Nov 6, 2024Updated last year
AmeenAli / HiddenMambaAttn
View on GitHub
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆234Oct 16, 2025Updated 9 months ago
assafbk / DeciMamba
View on GitHub
DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)
☆32Apr 9, 2025Updated last year
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
00ffcc / chunkRWKV6
View on GitHub
continous batching and parallel acceleration for RWKV6
☆23Jun 28, 2024Updated 2 years ago
HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
molML / s4-for-de-novo-drug-design
View on GitHub
The official codebase of the paper "Chemical language modeling with structured state space sequence models"
☆90Aug 1, 2024Updated last year
Eliyas0007 / Pytorch-Intention
View on GitHub
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12May 24, 2023Updated 3 years ago
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
usc-sail / trust-ser
View on GitHub
Trustworthy Speech Emotion Recognition
☆13May 22, 2023Updated 3 years ago
MzeroMiko / mamba-mini
View on GitHub
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…
☆109Oct 14, 2025Updated 9 months ago