Training small GPT-2 style models using Kolmogorov-Arnold networks.
☆122May 25, 2024Updated last year
Alternatives and similar repositories for KAN-GPT-2
Users that are interested in KAN-GPT-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling☆725Nov 25, 2024Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 3 months ago
- Convolutional layer for Kolmogorov-Arnold Network (KAN)☆117Mar 25, 2025Updated last year
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Jun 24, 2024Updated last year
- ☆10Oct 28, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- ☆32May 5, 2024Updated last year
- ☆13Jun 2, 2024Updated last year
- ☆21May 24, 2023Updated 2 years ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆472Jun 20, 2024Updated last year
- Lightning-like training API for JAX with Flax☆45Dec 8, 2024Updated last year
- ☆11Aug 20, 2025Updated 7 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 9 months ago
- Kolmogorov-Arnold Networks (KAN) using orthogonal polynomials instead of B-splines.☆40Nov 21, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,201Dec 14, 2025Updated 3 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated last year
- Example of how to use R in Jupyter notebooks and make compatible with Binder☆17Feb 25, 2019Updated 7 years ago
- ☆21Mar 1, 2023Updated 3 years ago
- Chrome extension that redacts potentially sensitive information before querying ChatGPT☆12Aug 10, 2023Updated 2 years ago
- ☆35Apr 12, 2024Updated last year
- ☆79Feb 4, 2025Updated last year
- ☆140May 8, 2024Updated last year
- ☆749May 24, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆297Apr 9, 2025Updated 11 months ago
- Created Francisco Angulo de Lafuente ⚡️Deploy the DEMO⬇️☆23Mar 22, 2026Updated last week
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆453May 13, 2025Updated 10 months ago
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 3 years ago
- ☆27Feb 1, 2023Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ☆13Jun 28, 2021Updated 4 years ago
- Additional multi-backend functionality for Keras 3.☆16Mar 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Train your own sub-1B foundation models JAX/GCP/TPUS in hours☆302Aug 28, 2024Updated last year
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).☆4,602Aug 1, 2024Updated last year
- ☆51Sep 26, 2025Updated 6 months ago
- Variations of Kolmogorov-Arnold Networks☆116May 15, 2024Updated last year
- ☆14Mar 31, 2024Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆277Jul 15, 2024Updated last year
- Implementation for MatMul-free LM.☆3,059Dec 2, 2025Updated 3 months ago