A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆161Nov 11, 2025Updated 5 months ago
Alternatives and similar repositories for kvax
Users that are interested in kvax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal yet performant LLM examples in pure JAX☆246Updated this week
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- JAX bindings for Flash Attention v2☆104Feb 28, 2026Updated last month
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- seqax = sequence modeling + JAX☆188Jul 23, 2025Updated 8 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆444Mar 26, 2026Updated 3 weeks ago
- LoRA for arbitrary JAX models and functions☆145Feb 26, 2024Updated 2 years ago
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 7 months ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Named Tensors for Legible Deep Learning in JAX☆216Nov 8, 2025Updated 5 months ago
- Implementation of Flash Attention in Jax☆228Mar 1, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 6 months ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆703Jan 26, 2026Updated 2 months ago
- ☆16Jul 8, 2024Updated last year
- A snappy + easy + pretty TUI debugger for Python.☆69Oct 3, 2025Updated 6 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 4 months ago
- ☆10Feb 20, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆15May 28, 2025Updated 10 months ago
- Pointax: PointMaze Environment for JAX☆27Oct 22, 2025Updated 5 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- Minimal, lightweight JAX implementations of popular models.☆228Mar 27, 2026Updated 2 weeks ago
- ☆353Apr 9, 2026Updated last week
- Tokamax: A GPU and TPU kernel library.☆198Updated this week
- An implementation of ESM2 in Equinox+JAX☆36Jun 5, 2025Updated 10 months ago
- ☆12Oct 10, 2023Updated 2 years ago
- JMP is a Mixed Precision library for JAX.☆212Jan 30, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Apr 24, 2023Updated 2 years ago
- ☆29Updated this week
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 8 months ago
- Official code release for "SuperBPE: Space Travel for Language Models"☆91Jan 9, 2026Updated 3 months ago
- JAX-Toolbox☆401Updated this week
- ESM2 protein language models in JAX/Flax☆19Oct 10, 2022Updated 3 years ago
- ☆574Jul 11, 2024Updated last year