AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆327Jun 23, 2025Updated 9 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆49Jan 5, 2026Updated 2 months ago
- ☆12Jun 11, 2024Updated last year
- GenAI inference performance benchmarking tool☆156Mar 16, 2026Updated last week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆172Updated this week
- ☆15Mar 18, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Jul 9, 2024Updated last year
- Gateway API Inference Extension☆616Mar 22, 2026Updated last week
- A toolkit to run Ray applications on Kubernetes☆2,408Updated this week
- WG Serving☆34Mar 5, 2026Updated 3 weeks ago
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated 11 months ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆686Updated this week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆327Updated this week
- ☆33Feb 4, 2026Updated last month
- An end-to-end operating model for onboarding and continually deploying services with Anthos.☆85May 3, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆39Mar 23, 2026Updated last week
- Create a secure ML environment on Vertex AI☆37Mar 19, 2026Updated last week
- Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative…☆677Updated this week
- Kubernetes-native Job Queueing☆2,399Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Aug 28, 2024Updated last year
- ☆327Mar 25, 2025Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆417Jan 5, 2026Updated 2 months ago
- AppWrapper controller for Kueue☆17Mar 20, 2026Updated last week
- ☆18Mar 11, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Apr 15, 2025Updated 11 months ago
- This repository compiles code samples and notebooks demonstrating how to use Generative AI on Google Cloud Vertex AI.☆831Jan 6, 2026Updated 2 months ago
- llm-d benchmark scripts and tooling☆51Mar 22, 2026Updated last week
- Sample applications for Google Kubernetes Engine (GKE)☆1,346Updated this week
- End-to-end modular samples and landing zones toolkit for Terraform on GCP.☆1,973Updated this week
- ☆16Jul 15, 2024Updated last year
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,168Updated this week
- ☆80Mar 20, 2026Updated last week
- ☆63Jan 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI☆16,437Mar 20, 2026Updated last week
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- A curated list of resources about all things Gemini in Google Cloud.☆79Jan 3, 2025Updated last year
- ☆35Mar 18, 2026Updated last week
- Simple Demos for Google ADK☆18Nov 11, 2025Updated 4 months ago
- Showcasing Google Cloud's generative AI for marketing scenarios via application frontend, backend, and detailed, step-by-step guidance fo…☆478Mar 20, 2026Updated last week
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆55Jan 21, 2026Updated 2 months ago