A set of Python scripts that makes your experience on TPU better
☆56Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for tpux
Users that are interested in tpux are comparing it to the libraries listed below
Sorting:
- ☆16Jul 8, 2024Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆35Jul 3, 2024Updated last year
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Sep 24, 2025Updated 5 months ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- JAX Synergistic Memory Inspector☆184Jul 16, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 3 months ago
- ☆26Updated this week
- Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project☆17Nov 9, 2023Updated 2 years ago
- ☆63Mar 4, 2022Updated 4 years ago
- ☆292Jul 15, 2024Updated last year
- ☆273Updated this week
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- ☆23Jul 11, 2025Updated 7 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- JAX implementation of LLaMA, aiming to train LLaMA on Google Cloud TPU☆14Jul 22, 2023Updated 2 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated last year
- ☆16Feb 24, 2026Updated last week
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- ☆13Feb 25, 2025Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- MLIR-based partitioning system☆167Updated this week
- If it quacks like a tensor...☆60Nov 13, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆16Oct 20, 2025Updated 4 months ago
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Jun 11, 2023Updated 2 years ago
- A Simple Statistical Distribution Library in JAX☆16Mar 30, 2024Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- JAX-Toolbox☆386Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,869Jun 22, 2025Updated 8 months ago
- ☆32Jul 2, 2025Updated 8 months ago
- Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom …☆25Jun 22, 2025Updated 8 months ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- DImensionality REduction in JAX☆25Nov 21, 2025Updated 3 months ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆439Feb 27, 2026Updated last week