AI-Hypercomputer / ml-goodput-measurementLinks
☆27Updated this week
Alternatives and similar repositories for ml-goodput-measurement
Users that are interested in ml-goodput-measurement are comparing it to the libraries listed below
Sorting:
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆159Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆109Updated this week
- ☆151Updated last week
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆57Updated this week
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated last week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆400Updated last week
- ☆69Updated last week
- ☆16Updated 7 months ago
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆36Updated last month
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Updated last month
- ☆73Updated this week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆307Updated this week
- ☆192Updated this week
- ☆59Updated this week
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated last year
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆105Updated this week
- a Jax quantization library☆84Updated this week
- ☆133Updated 3 weeks ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆213Updated this week
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆327Updated 6 months ago
- ☆16Updated 10 months ago
- Module, Model, and Tensor Serialization/Deserialization☆283Updated 4 months ago
- ☆342Updated last week
- ☆296Updated this week
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆544Updated this week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆55Updated 3 weeks ago
- Composable metric reporters in Python.☆14Updated last year
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Updated last month
- ☆81Updated this week
- Tokamax: A GPU and TPU kernel library.☆158Updated this week