google / paxmlLinks

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

☆513

Alternatives and similar repositories for paxml

Users that are interested in paxml are comparing it to the libraries listed below

Sorting:

google / praxis
☆186Updated last month
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆405Updated 3 weeks ago
stanford-crfm / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆607Updated this week
google / saxml
☆142Updated last week
google / orbax
Orbax provides common checkpointing and persistence utilities for JAX users
☆404Updated this week
NVIDIA / JAX-Toolbox
JAX-Toolbox
☆321Updated this week
google / aqt
☆320Updated 2 weeks ago
google / flaxformer
☆358Updated last year
rwitten / HighPerfLLMs2024
☆511Updated last year
google / grain
Library for reading and processing ML training data.
☆474Updated this week
google-deepmind / nanodo
☆273Updated last year
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆354Updated last month
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆219Updated last year
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆175Updated last year
MatX-inc / seqax
seqax = sequence modeling + JAX
☆165Updated last month
lucidrains / flash-attention-jax
Implementation of Flash Attention in Jax
☆213Updated last year
google / CommonLoopUtils
CLU lets you write beautiful training loops in JAX.
☆349Updated 3 weeks ago
jax-ml / ml_dtypes
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
☆280Updated this week
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆204Updated last year
google / fiddle
☆350Updated last week
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆388Updated this week
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆239Updated 2 years ago
jax-ml / scaling-book
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
☆424Updated last week
google-research / jaxpruner
☆230Updated 5 months ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆118Updated last year
pytorch-labs / monarch
PyTorch Single Controller
☆318Updated this week
google-deepmind / jmp
JMP is a Mixed Precision library for JAX.
☆206Updated 5 months ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆370Updated 2 weeks ago
ayaka14732 / tpu-starter
Everything you want to know about Google Cloud TPU
☆532Updated last year
facebookresearch / optimizers
For optimization algorithm research and development.
☆521Updated this week