AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆328Jun 23, 2025Updated 11 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆49May 5, 2026Updated 3 weeks ago
- ☆12Jun 11, 2024Updated last year
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆99May 22, 2026Updated last week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆133May 22, 2026Updated last week
- GenAI inference performance benchmarking tool☆190May 22, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15May 20, 2026Updated last week
- ☆11Jul 9, 2024Updated last year
- A toolkit to run Ray applications on Kubernetes☆2,508May 22, 2026Updated last week
- Gateway API Inference Extension☆675May 19, 2026Updated last week
- WG Serving☆37Mar 24, 2026Updated 2 months ago
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated last year
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆727May 21, 2026Updated last week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆340May 22, 2026Updated last week
- Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative…☆734May 19, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Create a secure ML environment on Vertex AI☆37May 22, 2026Updated last week
- Kubernetes-native Job Queueing☆2,524Updated this week
- Mono repo for open-sourcing Cloud Solutions Architects projects☆118May 21, 2026Updated last week
- ☆326Mar 25, 2025Updated last year
- ☆26Jan 26, 2026Updated 4 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆439Jan 5, 2026Updated 4 months ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆254May 20, 2026Updated last week
- AppWrapper controller for Kueue☆17May 22, 2026Updated last week
- Sample applications for Google Kubernetes Engine (GKE)☆1,350Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- llm-d benchmark scripts and tooling☆60May 22, 2026Updated last week
- ☆11Mar 16, 2026Updated 2 months ago
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,201Mar 31, 2026Updated last month
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform☆16,920May 22, 2026Updated last week
- ☆47Mar 25, 2023Updated 3 years ago
- ☆101May 14, 2026Updated 2 weeks ago
- A curated list of resources about all things Gemini in Google Cloud.☆79Jan 3, 2025Updated last year
- Simple Demos for Google ADK☆19Nov 11, 2025Updated 6 months ago
- Showcasing Google Cloud's generative AI for marketing scenarios via application frontend, backend, and detailed, step-by-step guidance fo…☆497Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆56Jan 21, 2026Updated 4 months ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 9 months ago
- Empowering LLM Agents for Real-World Computer System Optimization☆18Sep 10, 2025Updated 8 months ago
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- GKE Autopilot examples including using compute classes, GPU, workload separation, and more.☆11Sep 15, 2023Updated 2 years ago
- ☆37Apr 22, 2026Updated last month
- A high level scripting API for bot builders, developers, and maintainers.☆136Mar 25, 2026Updated 2 months ago