speed1313 / jax-llm

JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dataset.
12Updated 7 months ago

Alternatives and similar repositories for jax-llm:

Users that are interested in jax-llm are comparing it to the libraries listed below