andrewgcodes / xlstmLinks
my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture
☆131Updated last year
Alternatives and similar repositories for xlstm
Users that are interested in xlstm are comparing it to the libraries listed below
Sorting:
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆118Updated this week
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆173Updated last year
- Resources about xLSTM by Sepp Hochreiter☆316Updated 10 months ago
- Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports☆89Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆206Updated 3 weeks ago
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆296Updated last year
- ☆55Updated last year
- ☆96Updated last year
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆105Updated this week
- This repository contains a better implementation of Kolmogorov-Arnold networks