A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆167Nov 11, 2025Updated 7 months ago
Alternatives and similar repositories for kvax
Users that are interested in kvax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal yet performant LLM examples in pure JAX☆256Apr 10, 2026Updated 2 months ago
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- JAX bindings for Flash Attention v2☆106Feb 28, 2026Updated 3 months ago
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆26Sep 29, 2024Updated last year
- seqax = sequence modeling + JAX☆192Jul 23, 2025Updated 10 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆462Jun 1, 2026Updated 2 weeks ago
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 9 months ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Implementation of Flash Attention in Jax☆227Mar 1, 2024Updated 2 years ago
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 8 months ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆708Jan 26, 2026Updated 4 months ago
- ☆16Jul 8, 2024Updated last year
- A snappy + easy + pretty TUI debugger for Python.☆70May 22, 2026Updated 3 weeks ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆94Jan 25, 2024Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆34Nov 28, 2025Updated 6 months ago
- ☆10Feb 20, 2024Updated 2 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Pointax: PointMaze Environment for JAX☆28Oct 22, 2025Updated 7 months ago
- ☆355Apr 13, 2026Updated 2 months ago
- Tokamax: A GPU and TPU kernel library.☆227Updated this week
- An implementation of ESM2 in Equinox+JAX☆36Apr 20, 2026Updated last month
- ☆12Oct 10, 2023Updated 2 years ago
- JMP is a Mixed Precision library for JAX.☆214Jan 30, 2025Updated last year
- ☆10Apr 24, 2023Updated 3 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ESM2 protein language models in JAX/Flax☆19Oct 10, 2022Updated 3 years ago
- JAX-Toolbox☆412Jun 9, 2026Updated last week
- ☆583Jul 11, 2024Updated last year
- Official code release for "SuperBPE: Space Travel for Language Models"☆93May 28, 2026Updated 2 weeks ago
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆324Updated this week
- ☆34Updated this week
- Frechet inception distance (FID) evaluation in JAX☆14May 28, 2024Updated 2 years ago