The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
☆726Nov 25, 2024Updated last year
Alternatives and similar repositories for kan-gpt
Users that are interested in kan-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,628Aug 1, 2024Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆124May 25, 2024Updated last year
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆412May 13, 2024Updated last year
- Kolmogorov Arnold Networks☆16,258Jan 19, 2025Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,221Apr 2, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Variations of Kolmogorov-Arnold Networks☆116May 15, 2024Updated last year
- Benchmark for efficiency in memory and time of different KAN implementations.☆140Aug 26, 2024Updated last year
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆108Oct 4, 2025Updated 6 months ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆477Jun 20, 2024Updated last year
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆298Apr 9, 2025Updated last year
- KAN for Vision Transformer☆256Oct 7, 2024Updated last year
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆920Apr 8, 2025Updated last year
- ☆749May 24, 2024Updated last year
- TKAN: Temporal Kolmogorov-Arnold Networks☆228Dec 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Gemma 2B with 10M context length using Infini-attention.☆936May 12, 2024Updated last year
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆530Nov 19, 2024Updated last year
- [ICLR2025] Kolmogorov-Arnold Transformer☆856Mar 23, 2025Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆190Nov 24, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- Cloudflare Container Rust Example☆17Jun 25, 2025Updated 10 months ago
- PyTorch native post-training library☆5,739Updated this week
- Testing KAN-based text generation GPT models☆19May 6, 2024Updated last year
- ☆13Nov 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆140May 8, 2024Updated last year
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆45Jun 24, 2024Updated last year
- Temporal Kolmogorov-Arnold Transformer☆88Dec 27, 2024Updated last year
- Tools for merging pretrained large language models.☆7,023Mar 15, 2026Updated last month
- A simple feature-based time series classifier using Kolmogorov–Arnold Networks☆123Aug 17, 2024Updated last year
- Mamba SSM architecture☆18,118Updated this week
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆376Apr 23, 2024Updated 2 years ago
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,418Nov 29, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆29Mar 16, 2026Updated last month
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,454Jul 1, 2024Updated last year
- Understanding Kolmogorov-Arnold Networks: A Tutorial Series on KAN using Toy Examples☆202May 26, 2025Updated 11 months ago
- Kolmogorov-Arnold Networks (KAN) using Jacobi polynomials instead of B-splines.☆40May 9, 2024Updated last year
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆37May 8, 2024Updated last year
- This repository contains a better implementation of Kolmogorov-Arnold networks☆63Jun 1, 2025Updated 10 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,326May 4, 2024Updated last year