Minimal yet performant LLM examples in pure JAX
☆251Apr 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for jax-llm-examples
Users that are interested in jax-llm-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 5 months ago
- Minimal, lightweight JAX implementations of popular models.☆234Mar 27, 2026Updated last month
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 6 months ago
- Tokamax: A GPU and TPU kernel library.☆208Updated this week
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆935Mar 15, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆432Jan 5, 2026Updated 4 months ago
- Minimal JAX implementation of k-nearest neighbors using a k-d tree.☆57Jul 15, 2025Updated 9 months ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- ☆577Jul 11, 2024Updated last year
- A simple, performant and scalable Jax LLM!☆2,265Updated this week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆359Apr 27, 2026Updated last week
- ☆291Apr 27, 2026Updated last week
- JAX Synergistic Memory Inspector☆187Jul 16, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆704Jan 26, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- JAX backend for SGL☆268Updated this week
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆163Nov 11, 2025Updated 5 months ago
- Library for reading and processing ML training data.☆722Apr 30, 2026Updated last week
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- ☆16Jul 8, 2024Updated last year
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 7 months ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated 3 weeks ago
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- JAX - A curated list of resources https://github.com/google/jax☆2,102Jan 20, 2026Updated 3 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆450Apr 23, 2026Updated last week
- JAX-Toolbox☆405Updated this week
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆524Apr 16, 2026Updated 2 weeks ago
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- seqax = sequence modeling + JAX☆189Jul 23, 2025Updated 9 months ago
- A JAX-native High Performance Eval Metrics Library☆58Apr 13, 2026Updated 3 weeks ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,881Jun 22, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆24Dec 16, 2024Updated last year
- a Jax quantization library☆113Updated this week
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆94Jan 25, 2024Updated 2 years ago
- ☆31Updated this week
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Sep 24, 2025Updated 7 months ago
- ☆342Apr 30, 2026Updated last week