Training small GPT-2 style models using Kolmogorov-Arnold networks.
☆122May 25, 2024Updated 2 years ago
Alternatives and similar repositories for KAN-GPT-2
Users that are interested in KAN-GPT-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling☆724Nov 25, 2024Updated last year
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆36May 8, 2024Updated 2 years ago
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Jun 24, 2024Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆190Nov 24, 2024Updated last year
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆24Jun 8, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago
- ☆19May 11, 2024Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆38Mar 3, 2025Updated last year
- ☆31May 5, 2024Updated 2 years ago
- Lion - EvoLved Sign Momentum w/ New Optimizer API in TensorFlow 2.11+☆10Feb 16, 2023Updated 3 years ago
- ☆21May 24, 2023Updated 3 years ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆485Jun 20, 2024Updated last year
- Lightning-like training API for JAX with Flax☆46Dec 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Aug 20, 2025Updated 9 months ago
- Kolmogorov-Arnold Networks (KAN) using orthogonal polynomials instead of B-splines.☆40Nov 21, 2024Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,244Jun 1, 2026Updated last week
- ☆21Mar 1, 2023Updated 3 years ago
- ☆35Apr 12, 2024Updated 2 years ago
- ☆81Feb 4, 2025Updated last year
- ☆752May 24, 2024Updated 2 years ago
- ☆139May 8, 2024Updated 2 years ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆296Apr 9, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ECGDL: A framework for comparative study of databases and computational methods for arrhythmia detection from single-lead ECG☆20Aug 31, 2023Updated 2 years ago
- Created Francisco Angulo de Lafuente ⚡️Deploy the DEMO⬇️☆30May 8, 2026Updated last month
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆62Jun 1, 2025Updated last year
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆458May 13, 2025Updated last year
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ☆12Jun 28, 2021Updated 4 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JAX library for training sub-4B foundation models for edge☆302Aug 28, 2024Updated last year
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,646Aug 1, 2024Updated last year
- ☆54Sep 26, 2025Updated 8 months ago
- Variations of Kolmogorov-Arnold Networks☆116May 15, 2024Updated 2 years ago
- Neural Networks for JAX☆84Sep 24, 2024Updated last year
- ☆14Mar 31, 2024Updated 2 years ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆277Jul 15, 2024Updated last year