NX-AI / xlstmLinks
Official repository of the xLSTM.
☆1,902Updated 3 weeks ago
Alternatives and similar repositories for xlstm
Users that are interested in xlstm are comparing it to the libraries listed below
Sorting:
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆292Updated 11 months ago
- Resources about xLSTM by Sepp Hochreiter☆315Updated 7 months ago
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆167Updated 10 months ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,261Updated 6 months ago
- Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group☆498Updated last week
- Structured state space sequence models☆2,661Updated 11 months ago
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆876Updated 2 months ago
- ☆738Updated last year
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,820Updated last year
- Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆1,212Updated 11 months ago
- xLSTMTime for time series forecasting☆164Updated 7 months ago
- Build high-performance AI models with modular building blocks☆528Updated 2 weeks ago
- Unified Training of Universal Time Series Forecasting Transformers☆1,168Updated 2 months ago
- Implementation of the proposed minGRU in Pytorch☆299Updated 3 months ago
- The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling☆718Updated 7 months ago
- [ICLR2025] Kolmogorov-Arnold Transformer☆787Updated 3 months ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆380Updated last year
- Schedule-Free Optimization in PyTorch☆2,180Updated last month
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆130Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆883Updated last month
- Code for "Is Mamba Effective for Time Series Forecasting?"☆295Updated last month
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆418Updated last year
- [ICLR 2025 Spotlight] Official implementation of "Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts"☆660Updated last week
- Code release for DynamicTanh (DyT)☆954Updated 2 months ago
- An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://ar…☆2,010Updated 10 months ago
- Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https:…☆1,669Updated last month
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,388Updated 10 months ago
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆425Updated 6 months ago
- MOMENT: A Family of Open Time-series Foundation Models, ICML'24☆549Updated last month
- TKAN: Temporal Kolmogorov-Arnold Networks☆208Updated 6 months ago