WailordHe / DenseSSMLinks
A repository for DenseSSMs
☆87Updated last year
Alternatives and similar repositories for DenseSSM
Users that are interested in DenseSSM are comparing it to the libraries listed below
Sorting:
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 7 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated last month
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆60Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 9 months ago
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆28Updated last month
- ☆14Updated 8 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆222Updated last year
- ☆48Updated last year
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆104Updated 2 weeks ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆135Updated 4 months ago
- Triton implement of bi-directional (non-causal) linear attention☆48Updated 4 months ago