AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
☆328Jun 23, 2025Updated 11 months ago
Alternatives and similar repositories for ai-on-gke
Users that are interested in ai-on-gke are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆49May 5, 2026Updated last month
- ☆12Jun 11, 2024Updated 2 years ago
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆133Jun 8, 2026Updated last week
- GenAI inference performance benchmarking tool☆198Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,542Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Gateway API Inference Extension☆693Updated this week
- Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine☆16Apr 28, 2025Updated last year
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆741Updated this week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆346Updated this week
- An end-to-end operating model for onboarding and continually deploying services with Anthos.☆85May 3, 2024Updated 2 years ago
- ☆42Updated this week
- ☆36Updated this week
- Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative…☆743Jun 11, 2026Updated last week
- Kubernetes-native Job Queueing☆2,572Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mono repo for open-sourcing Cloud Solutions Architects projects☆122Updated this week
- Finetune LLMs on K8s by using Runbooks☆169Aug 28, 2024Updated last year
- ☆23Mar 11, 2026Updated 3 months ago
- ☆328Mar 25, 2025Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆445Jan 5, 2026Updated 5 months ago
- AppWrapper controller for Kueue☆17May 22, 2026Updated 3 weeks ago
- ☆21Mar 11, 2026Updated 3 months ago
- ☆15Apr 15, 2025Updated last year
- This repository compiles code samples and notebooks demonstrating how to use Generative AI on Google Cloud Vertex AI.☆837Jan 6, 2026Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- End-to-end modular samples and landing zones toolkit for Terraform on GCP.☆2,035Jun 12, 2026Updated last week
- llm-d benchmark scripts and tooling☆63Updated this week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,210Jun 10, 2026Updated last week
- Packaged configuration for setting up a Kubernetes cluster with Anthos Service Mesh features enabled☆143Updated this week
- ☆15Jan 26, 2021Updated 5 years ago
- Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform☆17,032Updated this week
- ☆47Mar 25, 2023Updated 3 years ago
- ☆106Jun 11, 2026Updated last week
- A curated list of resources about all things Gemini in Google Cloud.☆79Jan 3, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆35Mar 18, 2026Updated 3 months ago
- Simple Demos for Google ADK☆19Nov 11, 2025Updated 7 months ago
- A direct Google Cloud Storage integration for PyTorch☆46Mar 25, 2026Updated 2 months ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 10 months ago
- Empowering LLM Agents for Real-World Computer System Optimization☆18Sep 10, 2025Updated 9 months ago
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- Summarizes document using OCR and Vertex Generative AI LLM☆157Apr 14, 2026Updated 2 months ago