AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆327Jun 23, 2025Updated 8 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below
Sorting:
- ☆48Jan 5, 2026Updated 2 months ago
- ☆79Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆118Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆171Updated this week
- GenAI inference performance benchmarking tool☆151Feb 27, 2026Updated last week
- Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative…☆655Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆676Mar 3, 2026Updated last week
- Create a secure ML environment on Vertex AI☆35Updated this week
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆55Jan 21, 2026Updated last month
- ☆11Jul 9, 2024Updated last year
- ☆12Jun 11, 2024Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆414Jan 5, 2026Updated 2 months ago
- ☆17Feb 20, 2025Updated last year
- Sample applications for Google Kubernetes Engine (GKE)☆1,344Updated this week
- ☆63Feb 12, 2026Updated 3 weeks ago
- Mono repo for open-sourcing Cloud Solutions Architects projects☆106Updated this week
- AppWrapper controller for Kueue☆17Mar 1, 2026Updated last week
- ☆15Jan 26, 2021Updated 5 years ago
- ☆325Mar 25, 2025Updated 11 months ago
- End-to-end modular samples and landing zones toolkit for Terraform on GCP.☆1,957Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Aug 28, 2024Updated last year
- cert-manager issuer for Google CA Service☆93Mar 3, 2026Updated last week
- An end-to-end operating model for onboarding and continually deploying services with Anthos.☆85May 3, 2024Updated last year
- Packaged configuration for setting up a Kubernetes cluster with Anthos Service Mesh features enabled☆142Feb 20, 2026Updated 2 weeks ago
- Kubernetes-native Job Queueing☆2,347Updated this week
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI☆12,770Mar 1, 2026Updated last week
- This repository hosts the instructions and workshop materials for Lab 331 (Deep Research with Langchain and DeepSeek R1) for Microsoft Bu…☆26Jun 5, 2025Updated 9 months ago
- ☆39Updated this week
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆248Updated this week
- ☆35Nov 19, 2025Updated 3 months ago
- ☆35Jan 23, 2026Updated last month
- In this project, you will leverage Kubernetes Engine and Google Compute Engine to explore how Istio can manage services that reside outsi…☆55Dec 14, 2023Updated 2 years ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 7 months ago
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- ☆41Mar 2, 2026Updated last week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,158Feb 23, 2026Updated 2 weeks ago
- Istio demos and sample applications for GCP☆340Aug 15, 2023Updated 2 years ago
- Fine tune an LLM model to answer questions from your documents.☆155Feb 24, 2026Updated 2 weeks ago
- ☆32Feb 4, 2026Updated last month