A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆165Nov 11, 2025Updated 6 months ago
Alternatives and similar repositories for kvax
Users that are interested in kvax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal yet performant LLM examples in pure JAX☆257Apr 10, 2026Updated last month
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- JAX bindings for Flash Attention v2☆106Feb 28, 2026Updated 2 months ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- seqax = sequence modeling + JAX☆191Jul 23, 2025Updated 10 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆460Apr 23, 2026Updated last month
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Named Tensors for Legible Deep Learning in JAX☆219Nov 8, 2025Updated 6 months ago
- Implementation of Flash Attention in Jax☆228Mar 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 7 months ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆708Jan 26, 2026Updated 4 months ago
- ☆16Jul 8, 2024Updated last year
- A snappy + easy + pretty TUI debugger for Python.☆69Apr 20, 2026Updated last month
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆94Jan 25, 2024Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 5 months ago
- ☆10Feb 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated 11 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Run Slurm in Kubernetes☆384Updated this week
- Minimal, lightweight JAX implementations of popular models.☆235Mar 27, 2026Updated last month
- Pointax: PointMaze Environment for JAX☆28Oct 22, 2025Updated 7 months ago
- ☆355Apr 13, 2026Updated last month
- Tokamax: A GPU and TPU kernel library.☆220Updated this week
- An implementation of ESM2 in Equinox+JAX☆36Apr 20, 2026Updated last month
- ☆12Oct 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- JMP is a Mixed Precision library for JAX.☆213Jan 30, 2025Updated last year
- ☆10Apr 24, 2023Updated 3 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 9 months ago
- ESM2 protein language models in JAX/Flax☆19Oct 10, 2022Updated 3 years ago
- JAX-Toolbox☆411Updated this week
- ☆580Jul 11, 2024Updated last year
- Official code release for "SuperBPE: Space Travel for Language Models"☆92Jan 9, 2026Updated 4 months ago