AI-Hypercomputer / cloud-accelerator-diagnostics
☆20Updated 4 months ago
Alternatives and similar repositories for cloud-accelerator-diagnostics:
Users that are interested in cloud-accelerator-diagnostics are comparing it to the libraries listed below
- ☆132Updated 2 weeks ago
- A simple library for scaling up JAX programs☆129Updated 3 months ago
- JAX-Toolbox☆280Updated this week
- ☆183Updated last week
- ☆355Updated 7 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆51Updated this week
- JAX Synergistic Memory Inspector☆168Updated 6 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆478Updated last week
- ☆284Updated this week
- seqax = sequence modeling + JAX☆142Updated 6 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆102Updated this week
- Experiment of using Tangent to autodiff triton☆75Updated last year
- jax-triton contains integrations between JAX and OpenAI Triton☆378Updated 3 weeks ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆107Updated 2 weeks ago
- ☆337Updated 10 months ago
- Implementation of Flash Attention in Jax☆204Updated 11 months ago
- This repository contains the experimental PyTorch native float8 training UX☆221Updated 6 months ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆333Updated this week
- ☆65Updated 2 years ago
- ☆88Updated 8 months ago
- Google TPU optimizations for transformers models☆96Updated 3 weeks ago
- PyTorch per step fault tolerance (actively under development)☆243Updated this week
- Named Tensors for Legible Deep Learning in JAX☆161Updated 3 weeks ago
- JAX implementation of the Mistral 7b v0.2 model☆35Updated 7 months ago
- ☆207Updated 7 months ago
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆221Updated this week
- extensible collectives library in triton☆82Updated 4 months ago
- A set of Python scripts that makes your experience on TPU better☆48Updated 7 months ago
- Cataloging released Triton kernels.☆164Updated last month
- ☆14Updated 7 months ago