The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
☆724Nov 25, 2024Updated last year
Alternatives and similar repositories for kan-gpt
Users that are interested in kan-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,646Aug 1, 2024Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆122May 25, 2024Updated 2 years ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆413May 13, 2024Updated 2 years ago
- Kolmogorov Arnold Networks☆16,300Jan 19, 2025Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,244Jun 1, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Variations of Kolmogorov-Arnold Networks☆116May 15, 2024Updated 2 years ago
- Benchmark for efficiency in memory and time of different KAN implementations.☆140Aug 26, 2024Updated last year
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆108Oct 4, 2025Updated 8 months ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆485Jun 20, 2024Updated last year
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆296Apr 9, 2025Updated last year
- KAN for Vision Transformer☆256Oct 7, 2024Updated last year
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing th…☆919Apr 8, 2025Updated last year
- ☆752May 24, 2024Updated 2 years ago
- TKAN: Temporal Kolmogorov-Arnold Networks☆225Dec 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Gemma 2B with 10M context length using Infini-attention.☆933May 12, 2024Updated 2 years ago
- This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implem…☆531Nov 19, 2024Updated last year
- [ICLR2025] Kolmogorov-Arnold Transformer☆849Mar 23, 2025Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆190Nov 24, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆986Jul 23, 2024Updated last year
- Cloudflare Container Rust Example☆17Jun 25, 2025Updated 11 months ago
- PyTorch native post-training library☆5,768Updated this week
- Testing KAN-based text generation GPT models☆19May 6, 2024Updated 2 years ago
- ☆13Nov 19, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆139May 8, 2024Updated 2 years ago
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Jun 24, 2024Updated last year
- Temporal Kolmogorov-Arnold Transformer☆88Dec 27, 2024Updated last year
- Tools for merging pretrained large language models.☆7,108May 6, 2026Updated last month
- A simple feature-based time series classifier using Kolmogorov–Arnold Networks☆121Aug 17, 2024Updated last year
- Mamba SSM architecture☆18,376Jun 2, 2026Updated last week
- Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with I…☆376Apr 23, 2024Updated 2 years ago
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated 2 years ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,425Nov 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆29May 19, 2026Updated 3 weeks ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,523Jul 1, 2024Updated last year
- Understanding Kolmogorov-Arnold Networks: A Tutorial Series on KAN using Toy Examples☆205May 26, 2025Updated last year
- Kolmogorov-Arnold Networks (KAN) using Jacobi polynomials instead of B-splines.☆39May 9, 2024Updated 2 years ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆36May 8, 2024Updated 2 years ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆62Jun 1, 2025Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,325May 4, 2024Updated 2 years ago