Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated 9 months ago
Alternatives and similar repositories for llm-jax
Users that are interested in llm-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- ☆22Jan 23, 2024Updated 2 years ago
- Efficient optimizers☆321Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a Jax/Flax inference code of StarCoder☆12Jun 12, 2023Updated 2 years ago
- Tiny AutoEncoder for Stable Diffusion Videos☆36Oct 5, 2024Updated last year
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- ☆23Jan 5, 2025Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆65Feb 15, 2023Updated 3 years ago
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆32Apr 29, 2026Updated last week
- ☆24Dec 16, 2024Updated last year
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆197Apr 3, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 9 months ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 7 months ago
- ☆13Apr 25, 2024Updated 2 years ago
- KANs and MLPs☆12Jun 7, 2024Updated last year
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year