CG80499 / KAN-GPT-2

Training small GPT-2 style models using Kolmogorov-Arnold networks.
108Updated 5 months ago

Related projects

Alternatives and complementary repositories for KAN-GPT-2