GoogleCloudPlatform / ai-on-gkeView external linksLinks
AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆327Jun 23, 2025Updated 7 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below
Sorting:
- ☆48Jan 5, 2026Updated last month
- ☆77Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆115Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆169Feb 10, 2026Updated last week
- GenAI inference performance benchmarking tool☆145Feb 6, 2026Updated last week
- Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative…☆629Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆669Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,319Feb 9, 2026Updated last week
- Gateway API Inference Extension☆583Updated this week
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆78May 1, 2024Updated last year
- ☆11Jul 9, 2024Updated last year
- ☆12Jun 11, 2024Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆407Jan 5, 2026Updated last month
- ☆17Feb 20, 2025Updated 11 months ago
- Sample applications for Google Kubernetes Engine (GKE)☆1,338Updated this week
- ☆63Updated this week
- Mono repo for open-sourcing Cloud Solutions Architects projects☆101Updated this week
- ☆15Jan 26, 2021Updated 5 years ago
- ☆324Mar 25, 2025Updated 10 months ago
- End-to-end modular samples and landing zones toolkit for Terraform on GCP.☆1,941Updated this week
- cert-manager issuer for Google CA Service☆92Updated this week
- WG Serving☆34Dec 15, 2025Updated 2 months ago
- Packaged configuration for setting up a Kubernetes cluster with Anthos Service Mesh features enabled☆142Updated this week
- Kubernetes-native Job Queueing☆2,313Updated this week
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI☆12,657Updated this week
- This repository hosts the instructions and workshop materials for Lab 331 (Deep Research with Langchain and DeepSeek R1) for Microsoft Bu…☆26Jun 5, 2025Updated 8 months ago
- ☆38Feb 3, 2026Updated last week
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆247Feb 6, 2026Updated last week
- ☆35Nov 19, 2025Updated 2 months ago
- ☆22Jan 6, 2026Updated last month
- ☆39Updated this week
- ☆34Jan 23, 2026Updated 3 weeks ago
- In this project, you will leverage Kubernetes Engine and Google Compute Engine to explore how Istio can manage services that reside outsi…☆55Dec 14, 2023Updated 2 years ago
- ☆37Updated this week
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 6 months ago
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,152Updated this week
- Istio demos and sample applications for GCP☆340Aug 15, 2023Updated 2 years ago
- Fine tune an LLM model to answer questions from your documents.☆149Dec 11, 2025Updated 2 months ago