envoyproxy / ai-gatewayLinks
Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services.
☆340Updated this week
Alternatives and similar repositories for ai-gateway
Users that are interested in ai-gateway are comparing it to the libraries listed below
Sorting:
- Gateway API Inference Extension☆379Updated this week
- ☆159Updated 3 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆242Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆219Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆514Updated this week
- Next Generation Agentic Proxy for AI Agents and MCP servers☆232Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆682Updated this week
- All the things to make the scheduler extendable with wasm.☆125Updated 3 weeks ago
- eBPF Collector☆359Updated this week
- 🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆196Updated this week
- Spin Operator is a Kubernetes operator that empowers platform engineers to deploy Spin applications as custom resources to their Kubernet…☆260Updated this week
- The `ztunnel` component of ambient mesh☆387Updated this week
- Contains miscellaneous Wasm extensions for Istio☆120Updated 3 months ago
- NVIDIA DRA Driver for GPUs☆392Updated this week
- Prow is a Kubernetes based CI/CD system developed to serve the Kubernetes community. This repository contains Prow source code and Hugo s…☆204Updated last week
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆240Updated 3 weeks ago
- Model Context Protocol (MCP) server for Kubernetes and OpenShift☆374Updated this week
- Declarative Workflow of KubeVela which can run as standalone.☆122Updated 2 weeks ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆77Updated 2 weeks ago
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- GenAI inference performance benchmarking tool☆66Updated this week
- Automatic SRE Superpowers within your Kubernetes cluster☆377Updated this week
- Node Resource Interface☆312Updated last week
- ☆124Updated last week
- A Go framework for end-to-end testing of components running in Kubernetes clusters.☆582Updated 2 weeks ago
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,016Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated 2 months ago
- Kubernetes AI Toolchain Operator☆663Updated this week
- ☆404Updated this week
- Experiment for Multi cluster controllers with controller-runtime☆177Updated last week