yu-rp / KANbeFair
A More Fair and Comprehensive Comparison between KAN and MLP
☆150Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for KANbeFair
- Benchmark for efficiency in memory and time of different KAN implementations.☆111Updated 2 months ago
- Awesome list of papers that extend Mamba to various applications.☆128Updated 2 months ago
- ☆39Updated this week
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆103Updated 3 weeks ago
- C++ and Cuda ops for fused FourierKAN☆73Updated 6 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆137Updated last week
- ☆77Updated 5 months ago
- ☆37Updated 6 months ago
- Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel☆605Updated last month
- Benchmarking and Testing FastKAN☆65Updated 5 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆170Updated last week
- Implementation of a multimodal diffusion transformer in Pytorch☆97Updated 5 months ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆160Updated 3 months ago
- Reading list for research topics in state-space models☆242Updated 3 weeks ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆103Updated 3 months ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆367Updated 5 months ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆102Updated last month
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆202Updated 5 months ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆349Updated 6 months ago
- ☆103Updated this week
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆50Updated last week
- State Space Models☆63Updated 6 months ago
- Implementation of Agent Attention in Pytorch☆86Updated 4 months ago
- ☆119Updated 6 months ago
- Implementation of Infini-Transformer in Pytorch☆104Updated last month
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆70Updated 4 months ago
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆112Updated 3 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆94Updated this week
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆108Updated 5 months ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆84Updated last week