myscience / retnet-pytorch
Implementation of Retention-Network in PyTorch
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for retnet-pytorch
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆105Updated last week
- State Space Models☆63Updated 6 months ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆84Updated last week
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆41Updated 11 months ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆31Updated 6 months ago
- Combine B-Splines (BS) and Radial Basis Functions (RBF) in Kolmogorov-Arnold Networks (KANs)☆22Updated this week
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated last week
- A pytorch implementation of Fourier Analysis Networks (FAN)☆11Updated last month
- Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports☆75Updated 5 months ago
- This code implements a Radial Basis Function (RBF) based Kolmogorov-Arnold Network (KAN) for function approximation.☆25Updated 5 months ago
- A modified CNN architecture using Kolmogorov-Arnold Networks☆65Updated 5 months ago
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆140Updated 3 months ago
- Kolmogorov-Arnold Networks (KAN) using Jacobi polynomials instead of B-splines.☆32Updated 6 months ago
- MNIST example using Kolmogorov-Arnold Networks☆27Updated 6 months ago
- C++ and Cuda ops for fused FourierKAN☆73Updated 6 months ago
- ☆41Updated 7 months ago
- ☆62Updated last month
- ☆77Updated 5 months ago
- RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks☆76Updated 3 months ago
- Pytorch (Lightning) implementation of the Mamba model☆14Updated 6 months ago
- several types of attention modules written in PyTorch for learning purposes☆40Updated last month
- ☆17Updated 5 months ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆130Updated 6 months ago
- An implementation of mLSTM and sLSTM in PyTorch.☆25Updated 5 months ago
- ☆119Updated 6 months ago
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆25Updated this week
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆50Updated last week
- [ICLR 2024] Official code of RobustTSF☆15Updated 9 months ago
- ☆50Updated last month
- tinybig for deep function learning☆36Updated this week