andrewgcodes / xlstmLinks

my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture

☆131

Alternatives and similar repositories for xlstm

Users that are interested in xlstm are comparing it to the libraries listed below

Sorting:

kyegomez / xLSTM
Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"
☆119Updated 2 weeks ago
myscience / x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
☆169Updated 11 months ago
AI-Guru / xlstm-resources
Resources about xLSTM by Sepp Hochreiter
☆318Updated 8 months ago
kyegomez / MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
☆200Updated 2 weeks ago
muditbhargava66 / PyxLSTM
Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.
☆294Updated last year
smvorwerk / xlstm-cuda
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports
☆88Updated last year
sidhu2690 / Deep-KAN
This repository contains a better implementation of Kolmogorov-Arnold networks
☆63Updated 2 months ago
Indoxer / LKAN
Variations of Kolmogorov-Arnold Networks
☆115Updated last year
sidhu2690 / RBF-KAN
This code implements a Radial Basis Function (RBF) based Kolmogorov-Arnold Network (KAN) for function approximation.
☆29Updated last year
akaashdash / xlstm
☆51Updated last year
CG80499 / KAN-GPT-2
Training small GPT-2 style models using Kolmogorov-Arnold networks.
☆121Updated last year
jakariaemon / CNN-KAN
A modified CNN architecture using Kolmogorov-Arnold Networks
☆83Updated last year
mlsquare / xKAN
Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc
☆35Updated last year
quiqi / relu_kan
☆94Updated last year
SynodicMonth / ChebyKAN
Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.
☆383Updated last year
Zhangyanbo / MLP-KAN
Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)
☆106Updated 9 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
akaashdash / kansformers
☆136Updated last year
muslehal / xLSTMTime
xLSTMTime for time series forecasting
☆168Updated 8 months ago
MSD-IRIMAS / Simple-KAN-4-Time-Series
A simple feature-based time series classifier using Kolmogorov–Arnold Networks
☆119Updated 11 months ago
kyegomez / Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
☆179Updated 4 months ago
myscience / mamba
Pytorch (Lightning) implementation of the Mamba model
☆29Updated 3 months ago
lich99 / TiDE
Unofficial Implementation of Long-term Forecasting with TiDE: Time-series Dense Encoder
☆53Updated 2 years ago
remigenet / TKAN
TKAN: Temporal Kolmogorov-Arnold Networks
☆211Updated 7 months ago
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆56Updated last year
HazyResearch / spacetime
Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.
☆175Updated 2 years ago
AI-Guru / helibrunna
A HuggingFace compatible Small Language Model trainer.
☆75Updated 6 months ago
StarostinV / convkan
Convolutional layer for Kolmogorov-Arnold Network (KAN)
☆105Updated 4 months ago
team-daniel / KAN
Implementation on how to use Kolmogorov-Arnold Networks (KANs) for classification and regression tasks.
☆254Updated 11 months ago
YihongDong / FAN
☆243Updated 5 months ago