AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆327Jun 23, 2025Updated 10 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 11, 2024Updated last year
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆94May 1, 2026Updated last week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆131May 2, 2026Updated last week
- GenAI inference performance benchmarking tool☆180May 1, 2026Updated last week
- ☆15Apr 17, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jul 9, 2024Updated last year
- A toolkit to run Ray applications on Kubernetes☆2,485Updated this week
- WG Serving☆35Mar 24, 2026Updated last month
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated last year
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆715Updated this week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆337Apr 30, 2026Updated last week
- An end-to-end operating model for onboarding and continually deploying services with Anthos.☆85May 3, 2024Updated 2 years ago
- ☆35Apr 27, 2026Updated last week
- Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative…☆725Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Create a secure ML environment on Vertex AI☆37Apr 24, 2026Updated 2 weeks ago
- Kubernetes-native Job Queueing☆2,486Updated this week
- Mono repo for open-sourcing Cloud Solutions Architects projects☆118Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Aug 28, 2024Updated last year
- ☆327Mar 25, 2025Updated last year
- ☆26Jan 26, 2026Updated 3 months ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆251Apr 27, 2026Updated last week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆432Jan 5, 2026Updated 4 months ago
- AppWrapper controller for Kueue☆17Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆20Mar 11, 2026Updated last month
- ☆15Apr 15, 2025Updated last year
- This repository compiles code samples and notebooks demonstrating how to use Generative AI on Google Cloud Vertex AI.☆832Jan 6, 2026Updated 4 months ago
- End-to-end modular samples and landing zones toolkit for Terraform on GCP.☆2,002Updated this week
- llm-d benchmark scripts and tooling☆58May 1, 2026Updated last week
- ☆16Jul 15, 2024Updated last year
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,196Mar 31, 2026Updated last month
- ☆63Mar 25, 2026Updated last month
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform☆16,780May 1, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆96Updated this week
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆77May 1, 2024Updated 2 years ago
- A curated list of resources about all things Gemini in Google Cloud.☆79Jan 3, 2025Updated last year
- ☆35Mar 18, 2026Updated last month
- Simple Demos for Google ADK☆18Nov 11, 2025Updated 5 months ago
- Showcasing Google Cloud's generative AI for marketing scenarios via application frontend, backend, and detailed, step-by-step guidance fo…☆491Apr 17, 2026Updated 3 weeks ago
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆56Jan 21, 2026Updated 3 months ago