envoyproxy / ai-gateway
Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services.
☆220Updated this week
Alternatives and similar repositories for ai-gateway:
Users that are interested in ai-gateway are comparing it to the libraries listed below
- Gateway API Inference Extension☆229Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆218Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆402Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆123Updated this week
- ☆126Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆69Updated last month
- GenAI inference performance benchmarking tool☆36Updated 2 weeks ago
- All the things to make the scheduler extendable with wasm.☆116Updated 3 months ago
- A federation scheduler for multi-cluster☆36Updated 2 months ago
- Contains miscellaneous Wasm extensions for Istio☆118Updated 2 weeks ago
- Generates Kubernetes CRD API reference documentation