siyuan0 / gp-kanLinks
☆9Updated 11 months ago
Alternatives and similar repositories for gp-kan
Users that are interested in gp-kan are comparing it to the libraries listed below
Sorting:
- A More Fair and Comprehensive Comparison between KAN and MLP☆169Updated 10 months ago
- Benchmarking and Testing FastKAN☆78Updated last year
- Benchmark for efficiency in memory and time of different KAN implementations.☆126Updated 10 months ago
- C++ and Cuda ops for fused FourierKAN☆79Updated last year
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆62Updated this week
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆88Updated 4 months ago
- ☆26Updated 6 months ago
- Triton implement of bi-directional (non-causal) linear attention☆50Updated 4 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆67Updated 2 months ago
- ☆13Updated 8 months ago
- 😎 A curated list of tensor decomposition resources for model compression.☆69Updated this week
- ☆18Updated 8 months ago
- Clifford-Steerable Convolutional Neural Networks [ICML'24]☆47Updated last month
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆131Updated last week
- Unofficial Implementation of Selective Attention Transformer☆17Updated 7 months ago
- FlashRNN - Fast RNN Kernels with I/O Awareness☆91Updated 2 weeks ago
- Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems [ICML'25]☆66Updated this week
- ☆16Updated last year
- ☆92Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆64Updated 3 weeks ago
- ☆63Updated 4 months ago
- Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch☆63Updated 2 months ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆183Updated 7 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated 2 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆31Updated last year
- ☆40Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆28Updated 2 months ago
- A collection of tricks and tools to speed up transformer models☆167Updated 3 weeks ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆119Updated 8 months ago
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆105Updated 7 months ago