Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.
☆133Jun 5, 2026Updated this week
Alternatives and similar repositories for gpu-recipes
Users that are interested in gpu-recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆104Updated this week
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆64Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆183May 14, 2026Updated 3 weeks ago
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆100Updated this week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆343Jun 1, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆443Jan 5, 2026Updated 5 months ago
- ☆66Updated this week
- ☆13May 30, 2025Updated last year
- This repository contains base working directory for Codelab: Getting Started with Agent-to-Agent (A2A) Protocol: A Purchasing Concierge a…☆15Nov 18, 2025Updated 6 months ago
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆56Jan 21, 2026Updated 4 months ago
- GCP PCI-DSS 3.2.1 InSpec Profile☆18May 26, 2021Updated 5 years ago
- This repository compiles code samples and notebooks demonstrating how to use Generative AI on Google Cloud Vertex AI.☆837Jan 6, 2026Updated 5 months ago
- ☆16Mar 13, 2025Updated last year
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆328Jun 23, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TPU inference for vLLM, with unified JAX and PyTorch support.☆348Updated this week
- Automated Quality Control for Dialogflow CX Agents☆14May 3, 2024Updated 2 years ago
- ☆49May 5, 2026Updated last month
- AI/ML Recipes for Vertex AI, Serverless Spark and BigQuery open-source project is an effort to jumpstart your development of data process…☆80May 15, 2026Updated 3 weeks ago
- ☆356Updated this week
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 6 months ago
- Android has APKs, Docker has OCIs - Edge now has Edge Containers☆11May 23, 2026Updated 2 weeks ago
- LINEBot☆13Apr 7, 2025Updated last year
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Collection of OSS models that are containerized into a serving container☆16Sep 19, 2023Updated 2 years ago
- ☆11Sep 6, 2024Updated last year
- ☆12Mar 16, 2026Updated 2 months ago
- ☆45May 15, 2026Updated 3 weeks ago
- Deploy Backup and DR appliances☆13Apr 10, 2026Updated last month
- Google Cloud Product Cataloging Solution using Generative AI☆71Updated this week
- DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…☆91May 23, 2026Updated 2 weeks ago
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- Optimized primitives for collective multi-GPU communication☆11May 8, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Jul 30, 2025Updated 10 months ago
- Machine learning (ML) inference on Fastly's Compute@Edge☆16Jun 11, 2024Updated last year
- WG Serving☆37Mar 24, 2026Updated 2 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64May 5, 2026Updated last month
- Federated Learning on Google Cloud☆21May 11, 2026Updated 3 weeks ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆82Dec 18, 2025Updated 5 months ago
- Pubsub2Inbox is a versatile, multi-purpose tool to handle Pub/Sub messages and turn them into email, API calls, GCS objects, files or alm…☆47May 19, 2026Updated 2 weeks ago