The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
☆725Nov 25, 2024Updated last year
Alternatives and similar repositories for kan-gpt
Users that are interested in kan-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,605Aug 1, 2024Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆122May 25, 2024Updated last year
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆404May 13, 2024Updated last year
- Kolmogorov Arnold Networks☆16,215Jan 19, 2025Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,206Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Variations of Kolmogorov-Arnold Networks☆116May 15, 2024Updated last year
- Benchmark for efficiency in memory and time of different KAN implementations.☆139Aug 26, 2024Updated last year
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆108Oct 4, 2025Updated 6 months ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆472Jun 20, 2024Updated last year
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆297Apr 9, 2025Updated 11 months ago
- KAN for Vision Transformer☆253Oct 7, 2024Updated last year
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆919Apr 8, 2025Updated last year
- ☆749May 24, 2024Updated last year
- TKAN: Temporal Kolmogorov-Arnold Networks☆227Dec 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Gemma 2B with 10M context length using Infini-attention.☆937May 12, 2024Updated last year
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆529Nov 19, 2024Updated last year
- [ICLR2025] Kolmogorov-Arnold Transformer☆856Mar 23, 2025Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆189Nov 24, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- Cloudflare Container Rust Example☆17Jun 25, 2025Updated 9 months ago
- PyTorch native post-training library☆5,720Updated this week
- Testing KAN-based text generation GPT models☆18May 6, 2024Updated last year
- ☆13Nov 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆140May 8, 2024Updated last year
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Jun 24, 2024Updated last year
- Tools for merging pretrained large language models.☆6,945Mar 15, 2026Updated 3 weeks ago
- Temporal Kolmogorov-Arnold Transformer☆88Dec 27, 2024Updated last year
- A simple feature-based time series classifier using Kolmogorov–Arnold Networks☆123Aug 17, 2024Updated last year
- Mamba SSM architecture☆17,834Mar 30, 2026Updated last week
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆375Apr 23, 2024Updated last year
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,414Nov 29, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆29Mar 16, 2026Updated 3 weeks ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,409Jul 1, 2024Updated last year
- Kolmogorov-Arnold Networks (KAN) using Jacobi polynomials instead of B-splines.☆40May 9, 2024Updated last year
- Understanding Kolmogorov-Arnold Networks: A Tutorial Series on KAN using Toy Examples☆202May 26, 2025Updated 10 months ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆37May 8, 2024Updated last year
- This repository contains a better implementation of Kolmogorov-Arnold networks☆63Jun 1, 2025Updated 10 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,329May 4, 2024Updated last year