Training small GPT-2 style models using Kolmogorov-Arnold networks.
☆123May 25, 2024Updated last year
Alternatives and similar repositories for KAN-GPT-2
Users that are interested in KAN-GPT-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆414May 13, 2024Updated 2 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆37May 8, 2024Updated 2 years ago
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 4 months ago
- Convolutional layer for Kolmogorov-Arnold Network (KAN)☆119Mar 25, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Jan 14, 2026Updated 3 months ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆190Nov 24, 2024Updated last year
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆23Jun 8, 2025Updated 11 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last month
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆37Mar 3, 2025Updated last year
- CUDA implementation of Wavelet KAN.☆17Jun 8, 2024Updated last year
- ☆31May 5, 2024Updated 2 years ago
- A library for visualizing and animating PDDL domains.☆15Sep 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jun 2, 2024Updated last year
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆479Jun 20, 2024Updated last year
- ☆10Aug 20, 2025Updated 8 months ago
- Kolmogorov-Arnold Networks (KAN) using orthogonal polynomials instead of B-splines.☆40Nov 21, 2024Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,222Apr 2, 2026Updated last month
- ☆21Mar 1, 2023Updated 3 years ago
- ☆35Apr 12, 2024Updated 2 years ago
- ☆79Feb 4, 2025Updated last year
- ☆140May 8, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆750May 24, 2024Updated last year
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆298Apr 9, 2025Updated last year
- ☆14Apr 18, 2025Updated last year
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆63Jun 1, 2025Updated 11 months ago
- KAN for Vision Transformer☆257Oct 7, 2024Updated last year
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆458May 13, 2025Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ☆12Jun 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated 2 years ago
- Additional multi-backend functionality for Keras 3.☆16Mar 1, 2024Updated 2 years ago
- SOTA model implementations in JAX/FLAX☆301Aug 28, 2024Updated last year
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,635Aug 1, 2024Updated last year
- ☆54Sep 26, 2025Updated 7 months ago
- Neural Networks for JAX☆84Sep 24, 2024Updated last year
- ☆14Mar 31, 2024Updated 2 years ago