IBM / selective-dense-state-space-modelLinks
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages"
☆15Updated 2 months ago
Alternatives and similar repositories for selective-dense-state-space-model
Users that are interested in selective-dense-state-space-model are comparing it to the libraries listed below
Sorting:
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Updated last year
- Xmixers: A collection of SOTA efficient token/channel mixers☆29Updated 2 months ago
- ☆35Updated last year
- Flash-Linear-Attention models beyond language☆20Updated 2 months ago
- ☆27Updated last month
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- Stick-breaking attention☆61Updated 4 months ago
- ☆57Updated last year
- Here we will test various linear attention designs.☆61Updated last year
- ☆49Updated last year
- The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink…