AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆327Jun 23, 2025Updated 9 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆87Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆130Updated this week
- GenAI inference performance benchmarking tool☆166Apr 10, 2026Updated last week
- Gateway API Inference Extension☆639Apr 10, 2026Updated last week
- A toolkit to run Ray applications on Kubernetes☆2,448Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- WG Serving☆34Mar 24, 2026Updated 3 weeks ago
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated 11 months ago
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆334Updated this week
- An end-to-end operating model for onboarding and continually deploying services with Anthos.☆85May 3, 2024Updated last year
- ☆35Feb 4, 2026Updated 2 months ago
- Create a secure ML environment on Vertex AI☆38Updated this week
- Mono repo for open-sourcing Cloud Solutions Architects projects☆115Updated this week
- ☆23Mar 11, 2026Updated last month
- ☆327Mar 25, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆26Jan 26, 2026Updated 2 months ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆251Apr 9, 2026Updated last week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆424Jan 5, 2026Updated 3 months ago
- This repository compiles code samples and notebooks demonstrating how to use Generative AI on Google Cloud Vertex AI.☆830Jan 6, 2026Updated 3 months ago
- llm-d benchmark scripts and tooling☆55Apr 11, 2026Updated last week
- Sample applications for Google Kubernetes Engine (GKE)☆1,346Updated this week
- End-to-end modular samples and landing zones toolkit for Terraform on GCP.☆1,988Updated this week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,186Mar 31, 2026Updated 2 weeks ago
- ☆64Mar 25, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Packaged configuration for setting up a Kubernetes cluster with Anthos Service Mesh features enabled☆143Updated this week
- ☆90Updated this week
- ☆15Jan 26, 2021Updated 5 years ago
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI☆16,607Apr 10, 2026Updated last week
- ☆47Mar 25, 2023Updated 3 years ago
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆78May 1, 2024Updated last year
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- A curated list of resources about all things Gemini in Google Cloud.☆79Jan 3, 2025Updated last year
- ☆35Mar 18, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Showcasing Google Cloud's generative AI for marketing scenarios via application frontend, backend, and detailed, step-by-step guidance fo…☆486Updated this week
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆56Jan 21, 2026Updated 2 months ago
- A direct Google Cloud Storage integration for PyTorch☆46Mar 25, 2026Updated 3 weeks ago
- Empowering LLM Agents for Real-World Computer System Optimization☆17Sep 10, 2025Updated 7 months ago
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- Summarizes document using OCR and Vertex Generative AI LLM☆157Updated this week
- Open Source examples using Google Cloud to solve various Scientific and Technical Computing problems.☆24Apr 6, 2026Updated last week