Minimal yet performant LLM examples in pure JAX
☆245Jan 14, 2026Updated 2 months ago
Alternatives and similar repositories for jax-llm-examples
Users that are interested in jax-llm-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 3 months ago
- Minimal, lightweight JAX implementations of popular models.☆214Mar 4, 2026Updated 3 weeks ago
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 4 months ago
- Tokamax: A GPU and TPU kernel library.☆185Mar 19, 2026Updated last week
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆873Mar 15, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Minimal JAX implementation of k-nearest neighbors using a k-d tree.☆55Jul 15, 2025Updated 8 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆417Jan 5, 2026Updated 2 months ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- ☆571Jul 11, 2024Updated last year
- A simple, performant and scalable Jax LLM!☆2,182Updated this week
- ☆278Updated this week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆342Mar 20, 2026Updated last week
- JAX backend for SGL☆252Updated this week
- JAX Synergistic Memory Inspector☆185Jul 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆699Jan 26, 2026Updated 2 months ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆159Nov 11, 2025Updated 4 months ago
- Library for reading and processing ML training data.☆694Mar 20, 2026Updated last week
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- ☆16Jul 8, 2024Updated last year
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 6 months ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Nov 22, 2022Updated 3 years ago
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- JAX - A curated list of resources https://github.com/google/jax☆2,078Jan 20, 2026Updated 2 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆442Mar 13, 2026Updated 2 weeks ago
- JAX-Toolbox☆392Updated this week
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆522Mar 19, 2026Updated last week
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- seqax = sequence modeling + JAX☆187Jul 23, 2025Updated 8 months ago
- a Jax quantization library☆102Updated this week
- A JAX-native High Performance Eval Metrics Library☆58Feb 3, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,872Jun 22, 2025Updated 9 months ago
- ☆26Updated this week
- ☆24Dec 16, 2024Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆266Updated this week
- ☆318Mar 20, 2026Updated last week