The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
☆725Nov 25, 2024Updated last year
Alternatives and similar repositories for kan-gpt
Users that are interested in kan-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,602Aug 1, 2024Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆122May 25, 2024Updated last year
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆403May 13, 2024Updated last year
- Kolmogorov Arnold Networks☆16,218Jan 19, 2025Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,201Dec 14, 2025Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Variations of Kolmogorov-Arnold Networks☆116May 15, 2024Updated last year
- Benchmark for efficiency in memory and time of different KAN implementations.☆138Aug 26, 2024Updated last year
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆108Oct 4, 2025Updated 5 months ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆472Jun 20, 2024Updated last year
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆297Apr 9, 2025Updated 11 months ago
- KAN for Vision Transformer☆253Oct 7, 2024Updated last year
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆919Apr 8, 2025Updated 11 months ago
- ☆749May 24, 2024Updated last year
- TKAN: Temporal Kolmogorov-Arnold Networks☆227Dec 16, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Gemma 2B with 10M context length using Infini-attention.☆937May 12, 2024Updated last year
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆529Nov 19, 2024Updated last year
- [ICLR2025] Kolmogorov-Arnold Transformer☆855Mar 23, 2025Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆189Nov 24, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆989Jul 23, 2024Updated last year
- Cloudflare Container Rust Example☆17Jun 25, 2025Updated 9 months ago
- PyTorch native post-training library☆5,713Updated this week
- Testing KAN-based text generation GPT models☆18May 6, 2024Updated last year
- ☆13Nov 19, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆140May 8, 2024Updated last year
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Jun 24, 2024Updated last year
- Tools for merging pretrained large language models.☆6,895Mar 15, 2026Updated 2 weeks ago
- Temporal Kolmogorov-Arnold Transformer☆87Dec 27, 2024Updated last year
- A simple feature-based time series classifier using Kolmogorov–Arnold Networks☆123Aug 17, 2024Updated last year
- Mamba SSM architecture☆17,725Updated this week
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆375Apr 23, 2024Updated last year
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,405Nov 29, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆29Mar 16, 2026Updated 2 weeks ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,399Jul 1, 2024Updated last year
- Kolmogorov-Arnold Networks (KAN) using Jacobi polynomials instead of B-splines.☆40May 9, 2024Updated last year
- Understanding Kolmogorov-Arnold Networks: A Tutorial Series on KAN using Toy Examples☆202May 26, 2025Updated 10 months ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆36May 8, 2024Updated last year
- This repository contains a better implementation of Kolmogorov-Arnold networks☆63Jun 1, 2025Updated 9 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,331May 4, 2024Updated last year