ayaka14732 / tpu-starterLinks

Everything you want to know about Google Cloud TPU

☆546

Alternatives and similar repositories for tpu-starter

Users that are interested in tpu-starter are comparing it to the libraries listed below

Sorting:

ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆669Updated this week
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆179Updated last year
google / paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…
☆538Updated last month
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆371Updated 2 years ago
google / flaxformer
☆362Updated last year
facebookresearch / optimizers
For optimization algorithm research and development.
☆539Updated last week
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆400Updated last week
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆275Updated 3 years ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆119Updated last year
srush / annotated-mamba
Annotated version of the Mamba paper
☆489Updated last year
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆428Updated this week
rwitten / HighPerfLLMs2024
☆538Updated last year
lucidrains / flash-attention-jax
Implementation of Flash Attention in Jax
☆218Updated last year
google / aqt
☆332Updated last month
google-deepmind / nanodo
☆283Updated last year
google / praxis
☆189Updated 3 weeks ago
erfanzar / EasyDeL
Accelerate, Optimize performance with streamlined training and serving options with JAX.
☆314Updated 2 weeks ago
srush / Autodiff-Puzzles
☆454Updated last year
google / grain
Library for reading and processing ML training data.
☆567Updated this week
srush / LLM-Training-Puzzles
What would you do with 1000 H100s...
☆1,110Updated last year
BobMcDear / attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆577Updated 2 months ago
google-research / jaxpruner
☆234Updated 8 months ago
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆331Updated 2 years ago
apple / ml-sigma-reparam
☆308Updated last year
HenryNdubuaku / nanodl
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆294Updated last year
NVIDIA / JAX-Toolbox
JAX-Toolbox
☆350Updated this week
gpu-mode / profiling-cuda-in-torch
☆174Updated last year
google / CommonLoopUtils
CLU lets you write beautiful training loops in JAX.
☆356Updated 3 months ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆209Updated last year