Training small GPT-2 style models using Kolmogorov-Arnold networks.
☆123May 25, 2024Updated 2 years ago
Alternatives and similar repositories for KAN-GPT-2
Users that are interested in KAN-GPT-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆413May 13, 2024Updated 2 years ago
- The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling☆725Nov 25, 2024Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆36May 8, 2024Updated 2 years ago
- Your favourite classical machine learning algos on the GPU/TPU☆23Dec 14, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Jan 14, 2026Updated 5 months ago
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆24Jun 8, 2025Updated last year
- ☆10Oct 28, 2024Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 3 months ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆39Mar 3, 2025Updated last year
- ☆31May 5, 2024Updated 2 years ago
- Implementation for paper Automata Extraction from Transformers.☆12Jun 8, 2024Updated 2 years ago
- A library for visualizing and animating PDDL domains.☆15Sep 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jun 2, 2024Updated 2 years ago
- ☆21May 24, 2023Updated 3 years ago
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆486Jun 20, 2024Updated 2 years ago
- Lightning-like training API for JAX with Flax☆45Dec 8, 2024Updated last year
- ☆11Aug 20, 2025Updated 10 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆20Jun 11, 2025Updated last year
- Kolmogorov-Arnold Networks (KAN) using orthogonal polynomials instead of B-splines.☆40Nov 21, 2024Updated last year
- A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and mor…☆3,248Jun 1, 2026Updated 3 weeks ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆21Nov 24, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆81Feb 4, 2025Updated last year
- ☆752May 24, 2024Updated 2 years ago
- ☆139May 8, 2024Updated 2 years ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆295Apr 9, 2025Updated last year
- Created Francisco Angulo de Lafuente ⚡️Deploy the DEMO⬇️☆30May 8, 2026Updated last month
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆62Jun 1, 2025Updated last year
- KAN for Vision Transformer☆256Oct 7, 2024Updated last year
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆458May 13, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 4 years ago
- ☆28Feb 1, 2023Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ☆12Jun 28, 2021Updated 5 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated 2 years ago