google / paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
☆496Updated 2 weeks ago
Alternatives and similar repositories for paxml
Users that are interested in paxml are comparing it to the libraries listed below
Sorting:
- ☆186Updated 2 weeks ago
- ☆301Updated last week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆569Updated this week
- Orbax provides common checkpointing and persistence utilities for JAX users☆380Updated this week
- jax-triton contains integrations between JAX and OpenAI Triton☆393Updated 2 weeks ago
- JAX-Toolbox☆302Updated this week
- ☆138Updated 2 weeks ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆325Updated this week
- Library for reading and processing ML training data.☆441Updated this week
- ☆455Updated 10 months ago
- seqax = sequence modeling + JAX☆155Updated last month
- ☆217Updated 10 months ago
- ☆353Updated last year
- CLU lets you write beautiful training loops in JAX.☆338Updated last month
- JAX implementation of the Llama 2 model☆218Updated last year
- Train very large language models in Jax.☆204Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆536Updated this week
- Inference code for LLaMA models in JAX☆118Updated 11 months ago
- ☆347Updated last week
- JMP is a Mixed Precision library for JAX.☆198Updated 3 months ago
- JAX Synergistic Memory Inspector☆173Updated 10 months ago
- Implementation of Flash Attention in Jax☆209Updated last year
- Named Tensors for Legible Deep Learning in JAX☆173Updated 2 weeks ago
- Implementation of a Transformer, but completely in Triton☆265Updated 3 years ago
- ☆226Updated 3 months ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆239Updated 2 years ago
- Pipeline Parallelism for PyTorch☆765Updated 8 months ago
- A library to analyze PyTorch traces.☆370Updated this week
- Large Context Attention☆710Updated 3 months ago
- Everything you want to know about Google Cloud TPU☆528Updated 10 months ago