AdityaNG / kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
☆715Updated 4 months ago
Alternatives and similar repositories for kan-gpt:
Users that are interested in kan-gpt are comparing it to the libraries listed below
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆370Updated 11 months ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆403Updated 9 months ago
- ☆728Updated 10 months ago
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆853Updated last week
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆177Updated 4 months ago
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆2,880Updated last month
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,302Updated 8 months ago
- Variations of Kolmogorov-Arnold Networks☆114Updated 11 months ago
- KAN for Vision Transformer☆246Updated 6 months ago
- Build high-performance AI models with modular building blocks☆497Updated this week
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆116Updated 10 months ago
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆491Updated 5 months ago
- Understanding Kolmogorov-Arnold Networks: A Tutorial Series on KAN using Toy Examples☆185Updated 6 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆636Updated last month
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"☆1,182Updated last year
- [ICLR2025] Kolmogorov-Arnold Transformer☆754Updated 3 weeks ago
- Schedule-Free Optimization in PyTorch☆2,135Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆862Updated 2 months ago
- Implementation on how to use Kolmogorov-Arnold Networks (KANs) for classification and regression tasks.☆245Updated 7 months ago
- Official repository of the xLSTM.☆1,819Updated last week
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆166Updated 2 weeks ago
- Implementation of the proposed minGRU in Pytorch☆285Updated last month
- Annotated version of the Mamba paper☆481Updated last year
- Benchmark for efficiency in memory and time of different KAN implementations.☆121Updated 7 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆982Updated 8 months ago
- Resources about xLSTM by Sepp Hochreiter☆311Updated 5 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆549Updated 3 months ago
- TKAN: Temporal Kolmogorov-Arnold Networks☆200Updated 4 months ago
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆381Updated 10 months ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆922Updated last year