Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
☆65Mar 11, 2026Updated last week
Alternatives and similar repositories for ml-testing-accelerators
Users that are interested in ml-testing-accelerators are comparing it to the libraries listed below
Sorting:
- ☆16Mar 13, 2025Updated last year
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆61Updated this week
- ☆16Feb 18, 2026Updated last month
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Dec 18, 2025Updated 3 months ago
- ☆194Mar 10, 2026Updated last week
- ☆27Updated this week
- torchprime is a reference model implementation for PyTorch on TPU.☆46Mar 3, 2026Updated 2 weeks ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,756Dec 18, 2025Updated 3 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆551Updated this week
- ☆13Mar 14, 2026Updated last week
- ☆21Mar 3, 2025Updated last year
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated 9 months ago
- Google TPU optimizations for transformers models☆136Jan 23, 2026Updated last month
- ☆22Dec 14, 2021Updated 4 years ago
- ☆26Updated this week
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated 10 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆416Jan 5, 2026Updated 2 months ago
- K(ickstart)r for Typesafe projects☆11Oct 24, 2015Updated 10 years ago
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆55Updated this week
- ☆66Aug 2, 2022Updated 3 years ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆262Updated this week
- ☆150Feb 26, 2026Updated 3 weeks ago
- Automatically load data from Google Cloud Storage files into Big Query tables☆11Dec 30, 2022Updated 3 years ago
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆120Jul 8, 2025Updated 8 months ago
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆120Updated this week
- Prototype routines for GPU quantization written using PyTorch.☆21Feb 8, 2026Updated last month
- ☆20Feb 17, 2021Updated 5 years ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆489Mar 14, 2026Updated last week
- Fast, efficient code to pull non-null categorical data out, encode it and impute nulls with KNN Impute from fancyimpute library☆17Dec 8, 2019Updated 6 years ago
- ☆570Jul 11, 2024Updated last year
- PyTorch bindings for CUTLASS grouped GEMM.☆146May 29, 2025Updated 9 months ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- ☆136Mar 6, 2026Updated 2 weeks ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆487Updated this week
- Tensorflow2 training code with jit compiling on multi-GPU.☆17Jan 28, 2021Updated 5 years ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆250Mar 13, 2026Updated last week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Pipeline parallelism for the minimalist☆40Aug 6, 2025Updated 7 months ago