ayaka14732 / tpu-starterLinks
Everything you want to know about Google Cloud TPU
☆546Updated last year
Alternatives and similar repositories for tpu-starter
Users that are interested in tpu-starter are comparing it to the libraries listed below
Sorting:
- JAX Synergistic Memory Inspector☆179Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆663Updated this week
- JAX implementation of the Llama 2 model☆219Updated last year
- Puzzles for exploring transformers☆371Updated 2 years ago
- ☆362Updated last year
- For optimization algorithm research and development.☆538Updated this week
- Implementation of a Transformer, but completely in Triton☆274Updated 3 years ago
- Annotated version of the Mamba paper☆489Updated last year
- jax-triton contains integrations between JAX and OpenAI Triton☆423Updated 2 weeks ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆576Updated last month
- Implementation of Flash Attention in Jax☆217Updated last year
- What would you do with 1000 H100s...☆1,102Updated last year
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆536Updated 3 weeks ago
- Inference code for LLaMA models in JAX☆120Updated last year
- ☆535Updated last year
- ☆330Updated last week
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆397Updated this week
- ☆281Updated last year
- ☆187Updated 3 weeks ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆310Updated last week
- Language Modeling with the H3 State Space Model☆518Updated last year
- ☆454Updated 11 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆291Updated last year
- ☆233Updated 7 months ago
- JAX-Toolbox☆337Updated this week
- Task-based datasets, preprocessing, and evaluation for sequence models.☆586Updated last week
- ☆171Updated last year
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆382Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆257Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆298Updated last week