kserve / open-inference-protocol
Repository for open inference protocol specification
☆53Updated 9 months ago
Alternatives and similar repositories for open-inference-protocol:
Users that are interested in open-inference-protocol are comparing it to the libraries listed below
- Controller for ModelMesh☆228Updated last month
- User documentation for KServe.☆106Updated this week
- Distributed Model Serving Framework☆163Updated last month
- ☆117Updated this week
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆209Updated 3 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆92Updated this week
- Kubeflow Pipelines on Tekton☆180Updated 5 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆219Updated last week
- GenAI inference performance benchmarking tool☆39Updated 3 weeks ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆93Updated last week
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated last month
- KServe models web UI☆37Updated last week
- ☆143Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆504Updated this week
- kfctl is a CLI for deploying and managing Kubeflow☆184Updated last year
- K8s device plugin for GPU sharing☆100Updated last year
- Holistic job manager on Kubernetes☆115Updated last year
- A top-like tool for monitoring GPUs in a cluster☆86Updated last year
- Gateway API Inference Extension☆243Updated this week
- Helm charts for the KubeRay project☆43Updated 2 weeks ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆402Updated last week
- MLFlow Deployment Plugin for Ray Serve☆44Updated 3 years ago
- Argoflow has been superseded by deployKF☆137Updated last year
- MLOps Python Library☆118Updated 3 years ago
- This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project☆88Updated last year
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆348Updated this week
- Cloud Native Benchmarking of Foundation Models☆30Updated 5 months ago
- Fybrik☆132Updated last year
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆220Updated this week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆204Updated last year