A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆163Nov 11, 2025Updated 5 months ago
Alternatives and similar repositories for kvax
Users that are interested in kvax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal yet performant LLM examples in pure JAX☆251Apr 10, 2026Updated 3 weeks ago
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Sep 29, 2023Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- seqax = sequence modeling + JAX☆189Jul 23, 2025Updated 9 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆450Apr 23, 2026Updated last week
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Named Tensors for Legible Deep Learning in JAX☆219Nov 8, 2025Updated 5 months ago
- Implementation of Flash Attention in Jax☆228Mar 1, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 6 months ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆704Jan 26, 2026Updated 3 months ago
- ☆16Jul 8, 2024Updated last year
- A snappy + easy + pretty TUI debugger for Python.☆69Apr 20, 2026Updated 2 weeks ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆94Jan 25, 2024Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 5 months ago
- ☆10Feb 20, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated 11 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- Minimal, lightweight JAX implementations of popular models.☆234Mar 27, 2026Updated last month
- Pointax: PointMaze Environment for JAX☆28Oct 22, 2025Updated 6 months ago
- ☆354Apr 13, 2026Updated 3 weeks ago
- Tokamax: A GPU and TPU kernel library.☆208Updated this week
- An implementation of ESM2 in Equinox+JAX☆36Apr 20, 2026Updated 2 weeks ago
- ☆12Oct 10, 2023Updated 2 years ago
- JMP is a Mixed Precision library for JAX.☆212Jan 30, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 24, 2023Updated 3 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 8 months ago
- Official code release for "SuperBPE: Space Travel for Language Models"☆91Jan 9, 2026Updated 3 months ago
- JAX-Toolbox☆405Updated this week
- ESM2 protein language models in JAX/Flax☆19Oct 10, 2022Updated 3 years ago
- ☆577Jul 11, 2024Updated last year
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆323Apr 19, 2026Updated 2 weeks ago