envoyproxy / ai-gatewayLinks
Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services.
☆275Updated this week
Alternatives and similar repositories for ai-gateway
Users that are interested in ai-gateway are comparing it to the libraries listed below
Sorting:
- Gateway API Inference Extension☆304Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆457Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆229Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆178Updated last week
- Next Generation Agentic Proxy☆183Updated this week
- Spin Operator is a Kubernetes operator that empowers platform engineers to deploy Spin applications as custom resources to their Kubernet…☆252Updated last week
- ☆152Updated this week
- All the things to make the scheduler extendable with wasm.☆120Updated last week
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆235Updated 3 weeks ago
- Experiment for Multi cluster controllers with controller-runtime☆154Updated 3 weeks ago
- Contains miscellaneous Wasm extensions for Istio☆118Updated 2 months ago
- GenAI inference performance benchmarking tool☆44Updated this week
- Declarative Workflow of KubeVela which can run as standalone.☆120Updated last month
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆590Updated this week
- Cluster API Provider for Nested Clusters☆303Updated 8 months ago
- A Go framework for end-to-end testing of components running in Kubernetes clusters.☆579Updated last week
- The `ztunnel` component of ambient mesh☆379Updated this week
- This Kubernetes Operators installs WebAssembly support on your Kubernetes Nodes☆230Updated last week
- Kubernetes Work API☆66Updated 3 weeks ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆73Updated last week
- Prow is a Kubernetes based CI/CD system developed to serve the Kubernetes community. This repository contains Prow source code and Hugo s…☆192Updated last week
- eBPF Collector☆345Updated this week
- ☆397Updated this week
- A toolkit for discovering cluster network topology.☆53Updated this week
- K8s-mcp-server is a Model Context Protocol (MCP) server that enables AI assistants like Claude to securely execute Kubernetes commands. I…☆136Updated last month
- Smart Kubernetes Scheduling☆80Updated this week
- [EOL] Reworking kube-proxy's architecture☆246Updated 10 months ago
- Multi-cluster api gateway based on apiserver-aggregation.☆102Updated 2 weeks ago
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆363Updated this week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆970Updated last week