openxla / tokamaxLinks
Tokamax: A GPU and TPU kernel library.
☆165Updated this week
Alternatives and similar repositories for tokamax
Users that are interested in tokamax are comparing it to the libraries listed below
Sorting:
- Minimal yet performant LLM examples in pure JAX☆233Updated 2 weeks ago
- jax-triton contains integrations between JAX and OpenAI Triton☆436Updated last month
- JMP is a Mixed Precision library for JAX.☆211Updated 11 months ago
- A simple library for scaling up JAX programs☆144Updated 2 months ago
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆166Updated last week
- JAX-Toolbox☆377Updated last week
- seqax = sequence modeling + JAX☆170Updated 6 months ago
- Named Tensors for Legible Deep Learning in JAX☆217Updated 2 months ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆478Updated this week
- JAX Synergistic Memory Inspector☆184Updated last year
- ☆344Updated 3 weeks ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆130Updated last month
- Implementation of Flash Attention in Jax☆224Updated last year
- If it quacks like a tensor...☆59Updated last year
- OpTree: Optimized PyTree Utilities☆205Updated 3 weeks ago
- JAX bindings for Flash Attention v2☆103Updated 3 weeks ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated last year
- a Jax quantization library☆85Updated this week
- LoRA for arbitrary JAX models and functions☆143Updated last year
- ☆192Updated last week
- A library for unit scaling in PyTorch☆133Updated 6 months ago
- Minimal, lightweight JAX implementations of popular models.☆176Updated last week
- 🧱 Modula software package☆321Updated 5 months ago
- Einsum-like high-level array sharding API for JAX☆34Updated last year
- Experiment of using Tangent to autodiff triton☆81Updated 2 years ago
- ☆234Updated 11 months ago
- Tensor Parallelism with JAX + Shard Map☆11Updated 2 years ago
- Accelerated First Order Parallel Associative Scan☆194Updated 3 weeks ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆327Updated 3 weeks ago
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago