jax-ml / jax-tpu-embeddingLinks
☆24Updated last week
Alternatives and similar repositories for jax-tpu-embedding
Users that are interested in jax-tpu-embedding are comparing it to the libraries listed below
Sorting:
- Experiment of using Tangent to autodiff triton☆79Updated last year
- jax-triton contains integrations between JAX and OpenAI Triton☆432Updated 3 weeks ago
- ☆145Updated this week
- A simple library for scaling up JAX programs☆144Updated this week
- ☆190Updated 2 weeks ago
- Tokamax: A GPU and TPU kernel library.☆102Updated this week
- Implementation of Flash Attention in Jax☆220Updated last year
- JMP is a Mixed Precision library for JAX.☆208Updated 9 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆124Updated last month
- Minimal yet performant LLM examples in pure JAX☆193Updated last month
- ☆116Updated this week
- JAX Synergistic Memory Inspector☆179Updated last year
- ☆21Updated 8 months ago
- ☆62Updated 3 years ago
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆117Updated this week
- Einsum-like high-level array sharding API for JAX☆34Updated last year
- seqax = sequence modeling + JAX☆168Updated 3 months ago
- JAX implementation of the Mistral 7b v0.2 model☆34Updated last year
- Machine Learning eXperiment Utilities☆46Updated 3 months ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated last year
- JAX bindings for Flash Attention v2☆97Updated this week
- JAX-Toolbox☆359Updated this week
- Named Tensors for Legible Deep Learning in JAX☆211Updated 3 weeks ago
- ☆53Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆148Updated 6 months ago
- If it quacks like a tensor...☆59Updated 11 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆305Updated last week
- PyTorch centric eager mode debugger☆48Updated 10 months ago
- JAX implementation of the Llama 2 model☆215Updated last year
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Updated 2 years ago