yixiaoer / einshardLinks
Einsum-like high-level array sharding API for JAX
☆34Updated last year
Alternatives and similar repositories for einshard
Users that are interested in einshard are comparing it to the libraries listed below
Sorting:
- A simple library for scaling up JAX programs☆144Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆34Updated last year
- Turn jitted jax functions back into python source code☆22Updated 10 months ago
- Minimal yet performant LLM examples in pure JAX☆187Updated last month
- If it quacks like a tensor...☆59Updated 11 months ago
- LoRA for arbitrary JAX models and functions☆141Updated last year
- JAX Arrays for human consumption☆109Updated last week
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- Minimal, lightweight JAX implementations of popular models.☆117Updated this week
- A functional training loops library for JAX☆88Updated last year
- Pytorch-like dataloaders for JAX.☆93Updated 5 months ago
- Multiple dispatch over abstract array types in JAX.☆134Updated 3 weeks ago
- Tidy autoregressive inference in JAX☆14Updated 2 months ago
- Named Tensors for Legible Deep Learning in JAX☆211Updated 2 weeks ago
- Visualize, create, and operate on pytrees in the most intuitive way possible.☆45Updated 9 months ago
- Tokamax: A GPU and TPU kernel library.☆95Updated this week
- JMP is a Mixed Precision library for JAX.☆209Updated 9 months ago
- Implementation of PSGD optimizer in JAX☆35Updated 10 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- ☆120Updated 4 months ago
- ☆44Updated 2 months ago
- Neural Networks for JAX☆84Updated last year
- ☆39Updated last year
- ☆116Updated this week
- Schedule free optimiser implemented in JAX using Optimistix☆15Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆62Updated 2 years ago
- ☆33Updated last year
- Experiment of using Tangent to autodiff triton☆80Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year