Klassikcat / KANElectraLinks
Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)
☆28Updated 2 weeks ago
Alternatives and similar repositories for KANElectra
Users that are interested in KANElectra are comparing it to the libraries listed below
Sorting:
- An implementation of mLSTM and sLSTM in PyTorch.☆28Updated last year
- Fast Convolutional KAN☆61Updated last year
- ☆133Updated last year
- State Space Models☆67Updated last year
- Convolutional layer for Kolmogorov-Arnold Network (KAN)☆100Updated 3 months ago
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Updated 2 months ago
- ☆63Updated 4 months ago
- ☆47Updated last year
- A modified CNN architecture using Kolmogorov-Arnold Networks☆80Updated last year
- Simba☆208Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated 2 months ago
- Trainable Highly-expressive Activation Functions. ECCV 2024☆38Updated 4 months ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆43Updated 6 months ago
- C++ and Cuda ops for fused FourierKAN☆79Updated last year
- Official implementation of paper Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Series Forec…☆16Updated 2 months ago
- This repository contains the codes to replicate the simulations from the paper: "Wav-KAN: Wavelet Kolmogorov-Arnold Networks". It showca…☆147Updated 2 weeks ago
- A simple Bidirectional Mamba☆21Updated last year
- Semantics-Aware Patch Encoding and Hierarchical Dependency Modeling for Long-Term Time Series Forecasting☆45Updated 2 weeks ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 8 months ago
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆24Updated last year
- Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports☆87Updated last year
- Replacement classic layers with KAN Block for Hyperspectral data☆34Updated 6 months ago
- MNIST example using Kolmogorov-Arnold Networks☆27Updated last year
- Drop-in convolutional Kolmogorov-Arnold Network replacement of Conv2d☆18Updated last year
- ☆43Updated 4 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆64Updated last month
- A repository for DenseSSMs☆87Updated last year
- A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related web…☆17Updated last year
- ☆23Updated 8 months ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆60Updated 11 months ago