NX-AI / xlstm
Official repository of the xLSTM.
☆1,777Updated last week
Alternatives and similar repositories for xlstm:
Users that are interested in xlstm are comparing it to the libraries listed below
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆285Updated 8 months ago
- Resources about xLSTM by Sepp Hochreiter☆309Updated 4 months ago
- xLSTM as Generic Vision Backbone☆465Updated 4 months ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,171Updated 3 months ago
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆850Updated 4 months ago
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆159Updated 7 months ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,753Updated last year
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆393Updated 9 months ago
- Schedule-Free Optimization in PyTorch☆2,116Updated 3 weeks ago
- Structured state space sequence models☆2,581Updated 8 months ago
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆2,826Updated last month
- Implementation of the proposed minGRU in Pytorch☆283Updated last week
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,265Updated 7 months ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆368Updated 10 months ago
- ☆725Updated 10 months ago
- [ICLR2025] Kolmogorov-Arnold Transformer☆730Updated last month
- Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆1,144Updated 8 months ago
- Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports☆86Updated 9 months ago
- Code for "Is Mamba Effective for Time Series Forecasting?"☆260Updated 2 months ago
- Code release for DynamicTanh (DyT)☆710Updated last week
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆130Updated 10 months ago
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆477Updated 4 months ago
- Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group☆484Updated 2 months ago
- Unified Training of Universal Time Series Forecasting Transformers☆1,057Updated last month
- TKAN: Temporal Kolmogorov-Arnold Networks☆193Updated 3 months ago
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"☆1,177Updated last year
- xLSTMTime for time series forecasting☆147Updated 4 months ago
- Annotated version of the Mamba paper☆475Updated last year
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆419Updated 3 months ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆178Updated 4 months ago