ag1988 / dlr
The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonathan Berant).
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dlr
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆13Updated 2 weeks ago
- Implementations of various linear RNN layers using pytorch and triton☆46Updated last year
- ☆42Updated 6 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆120Updated last year
- ☆31Updated 10 months ago
- A State-Space Model with Rational Transfer Function Representation.☆70Updated 6 months ago
- PyTorch-based library for various kinds of representational-similarity analysis☆22Updated 5 months ago
- Sequence Modeling with Structured State Spaces☆60Updated 2 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆62Updated 2 years ago
- ☆18Updated last year
- Layerwise Batch Entropy Regularization☆22Updated 2 years ago
- Blog post☆16Updated 9 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆47Updated 2 weeks ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆69Updated 2 years ago
- ☆46Updated last month
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆61Updated 6 months ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated last year
- ☆25Updated 4 months ago
- ☆62Updated 3 months ago
- ☆23Updated 8 months ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Updated last month
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Updated 5 months ago
- Efficient PScan implementation in PyTorch☆15Updated 10 months ago
- Relative Positional Encoding for Transformers with Linear Complexity☆61Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".☆30Updated 2 weeks ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆52Updated last month
- RWKV model implementation☆38Updated last year