Adamdad / katLinks
[ICLR2025] Kolmogorov-Arnold Transformer
☆844Updated 9 months ago
Alternatives and similar repositories for kat
Users that are interested in kat are comparing it to the libraries listed below
Sorting:
- KAN for Vision Transformer☆255Updated last year
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆300Updated last year
- ☆78Updated 11 months ago
- Code release for DynamicTanh (DyT)☆1,031Updated 9 months ago
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆530Updated last year
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆911Updated 8 months ago
- ☆140Updated last year
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆400Updated last year
- xLSTM as Generic Vision Backbone☆492Updated 2 months ago
- Benchmarking and Testing FastKAN☆89Updated last year
- ☆748Updated last year
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆460Updated last year
- A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.☆686Updated 4 months ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆177Updated last year
- A More Fair and Comprehensive Comparison between KAN and MLP☆177Updated last year
- ☆253Updated 2 months ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆186Updated last year
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆747Updated 6 months ago
- Minimal Mamba-2 implementation in PyTorch☆239Updated last year
- Benchmark for efficiency in memory and time of different KAN implementations.☆137Updated last year
- Simba☆215Updated last year
- When it comes to optimizers, it's always better to be safe than sorry☆399Updated 3 months ago
- The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink…☆754Updated 2 weeks ago
- Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Mod…☆480Updated last week
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,396Updated last year
- 🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥☆530Updated last year
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆232Updated 2 months ago
- Causal depthwise conv1d in CUDA, with a PyTorch interface☆688Updated last week
- Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis☆260Updated 5 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆211Updated 2 months ago