Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.
☆120Mar 17, 2026Updated this week
Alternatives and similar repositories for gpu-recipes
Users that are interested in gpu-recipes are comparing it to the libraries listed below
Sorting:
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆61Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆171Mar 13, 2026Updated last week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆327Updated this week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆416Jan 5, 2026Updated 2 months ago
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated 10 months ago
- ☆63Updated this week
- ☆12May 30, 2025Updated 9 months ago
- This repository compiles code samples and notebooks demonstrating how to use Generative AI on Google Cloud Vertex AI.☆830Jan 6, 2026Updated 2 months ago
- GCP PCI-DSS 3.2.1 InSpec Profile☆18May 26, 2021Updated 4 years ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆262Updated this week
- CUDA Template Functions☆20Dec 16, 2025Updated 3 months ago
- Automated Quality Control for Dialogflow CX Agents☆14May 3, 2024Updated last year
- AI/ML Recipes for Vertex AI, Serverless Spark and BigQuery open-source project is an effort to jumpstart your development of data process…☆70Updated this week
- ☆32Oct 31, 2025Updated 4 months ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- This repository contains the results and code for the MLPerf™ Inference v4.0 benchmark.☆11Jul 24, 2025Updated 7 months ago
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆327Jun 23, 2025Updated 8 months ago
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- Collection of OSS models that are containerized into a serving container☆16Sep 19, 2023Updated 2 years ago
- ☆11Sep 6, 2024Updated last year
- ☆45Mar 13, 2026Updated last week
- Scripts and YAML files for the Bank of Anthos sample application☆23Dec 14, 2023Updated 2 years ago
- ☆35Nov 19, 2025Updated 4 months ago
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- ☆13Dec 6, 2022Updated 3 years ago
- DEPRECATED repo for Manning book Deep Learning with Structured Data - please see https://github.com/ryanmark1867/deep_learning_for_struct…☆12May 17, 2020Updated 5 years ago
- Google Cloud Product Cataloging Solution using Generative AI☆62Mar 14, 2026Updated last week
- A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like B…☆15Updated this week
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆65Mar 11, 2026Updated last week
- RAD Lab enables users to deploy infrastructure on Google Cloud Platform (GCP) to support specific use cases. Infrastructure is created an…☆112Feb 18, 2026Updated last month
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Dec 18, 2025Updated 3 months ago
- A simple, performant and scalable Jax LLM!☆2,170Updated this week
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- ☆15Jan 10, 2025Updated last year
- Code labs for Vertex AI☆46Oct 14, 2021Updated 4 years ago
- This repo hosts code for vLLM CI & Performance Benchmark infrastructure.☆32Updated this week
- A direct Google Cloud Storage integration for PyTorch☆45Mar 12, 2026Updated last week
- A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning.☆871Updated this week
- ☆10Jul 18, 2018Updated 7 years ago