jenkspt/gpt-jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jenkspt/gpt-jax)

jenkspt / gpt-jax

Jax/Flax rewrite of Karpathy's nanoGPT

☆65

Alternatives and similar repositories for gpt-jax

Users that are interested in gpt-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cgarciae / nanoGPT-jax
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆38Dec 3, 2023Updated 2 years ago
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated 11 months ago
young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 7 months ago
hr0nix / dejax
View on GitHub
Accelerated replay buffers in JAX
☆46Sep 17, 2022Updated 3 years ago
ml-gde / jaxgarden
View on GitHub
A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax
☆23Jun 8, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
brianfitzgerald / jax-mmdit
View on GitHub
Implementation of Diffusion Transformers and Rectified Flow in Jax
☆27Jul 9, 2024Updated 2 years ago
cgarciae / nnx
View on GitHub
Neural Networks for JAX
☆84Sep 24, 2024Updated last year
kelechi-c / mini_DiT
View on GitHub
minimal diffusion transformer in pytorch.
☆17Oct 6, 2024Updated last year
TheodoreWolf / hyperoptax
View on GitHub
Parallel hyperparameter tuning with JAX
☆39May 21, 2026Updated last month
kelechi-c / dit_flow
View on GitHub
DiT (training + flow matching) in Jax
☆12Jan 5, 2025Updated last year
ml-gde / jflux
View on GitHub
JAX Implementation of Black Forest Labs' Flux.1 family of models
☆40Jun 18, 2026Updated 3 weeks ago
cgarciae / jax_metrics
View on GitHub
A metrics library for the JAX ecosystem
☆41Mar 16, 2023Updated 3 years ago
jax-ml / jax-ai-stack
View on GitHub
☆303Jun 29, 2026Updated last week
bryanoliveira / sliding-puzzles-gym
View on GitHub
A scalable benchmark for state representation learning in visual reinforcement learning.
☆17Jun 23, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DSLwDE / DSLwDE
View on GitHub
☆14Jul 25, 2025Updated 11 months ago
cgarciae / ciclo
View on GitHub
A functional training loops library for JAX
☆88Feb 13, 2024Updated 2 years ago
haydn-jones / SOAP_JAX
View on GitHub
Unofficial JAX implementation of the SOAP optimizer (https://arxiv.org/abs/2409.11321)
☆27Updated this week
MathIsAll / ZO-AdaMU
View on GitHub
This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…
☆14Dec 12, 2023Updated 2 years ago
microsoft / mutransformers
View on GitHub
some common Huggingface transformers in maximal update parametrization (µP)
☆87Mar 14, 2022Updated 4 years ago
yixiaoer / einshard
View on GitHub
Einsum-like high-level array sharding API for JAX
☆35Jul 16, 2024Updated last year
lkwq007 / flux-flax
View on GitHub
JAX port of FLUX.1 models using flax.nnx
☆23Sep 28, 2024Updated last year
roger-creus / stable-deep-rl-at-scale
View on GitHub
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆39Oct 24, 2025Updated 8 months ago
dlwh / jax_sourceror
View on GitHub
Turn jitted jax functions back into python source code
☆23Dec 16, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GallagherCommaJack / modulax
View on GitHub
☆18Aug 24, 2024Updated last year
instadeepai / sebulba
View on GitHub
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆61Oct 23, 2023Updated 2 years ago
dibyaghosh / jaxrl_m
View on GitHub
Skeleton for scalable and flexible Jax RL implementations
☆100Jul 1, 2023Updated 3 years ago
google / CommonLoopUtils
View on GitHub
CLU lets you write beautiful training loops in JAX.
☆368Updated this week
fattorib / ZeRO-transformer
View on GitHub
Two implementations of ZeRO-1 optimizer sharding in JAX
☆14Jun 11, 2023Updated 3 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 4 years ago
DarshanDeshpande / jax-models
View on GitHub
Unofficial JAX implementations of deep learning research papers
☆162Jun 25, 2022Updated 4 years ago
modelbased / minirllab
View on GitHub
Mini RL Lab
☆16Jun 17, 2024Updated 2 years ago
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Itomigna2 / Muesli-lunarlander
View on GitHub
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆20Mar 18, 2024Updated 2 years ago
rwitten / HighPerfLLMs2024
View on GitHub
☆590Jul 11, 2024Updated last year
google-deepmind / nanodo
View on GitHub
☆304Jul 15, 2024Updated last year
google / grain
View on GitHub
Library for reading and processing ML training data.
☆747Jul 2, 2026Updated last week
mgrankin / minGPT
View on GitHub
minGPT in JAX
☆49Jan 10, 2022Updated 4 years ago
erfanzar / jax-flash-attn2
View on GitHub
A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…
☆34Mar 4, 2025Updated last year
vpj / jax_transformer
View on GitHub
Autoregressive transformer in JAX from scratch
☆23Jan 28, 2022Updated 4 years ago