i404788 / s5-pytorchLinks
Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)
☆74Updated last year
Alternatives and similar repositories for s5-pytorch
Users that are interested in s5-pytorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆82Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆51Updated last year
- Unofficial implementation of Linear Recurrent Units, by Deepmind, in Pytorch☆69Updated last month
- ☆290Updated 4 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆54Updated 7 months ago
- Parallelizing non-linear sequential models over the sequence length☆51Updated 4 months ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆64Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated 2 months ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- Sequence Modeling with Structured State Spaces☆64Updated 2 years ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆68Updated 5 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆137Updated 4 months ago
- ☆26Updated 10 months ago
- ☆129Updated last year
- ☆178Updated last year
- ☆29Updated 6 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆101Updated last year
- Implementation of the proposed minGRU in Pytorch☆296Updated 2 months ago
- A PyTorch implementation of Legendre Memory Units (LMUs) and its FFT variant☆42Updated 3 years ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆89Updated 2 weeks ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 6 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆88Updated 11 months ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆94Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆28Updated last month
- MoMo: Momentum Models for Adaptive Learning Rates☆19Updated 11 months ago
- ☆40Updated last year
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆62Updated 2 months ago
- ☆23Updated 8 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆121Updated last month