kserve / open-inference-protocol
Repository for open inference protocol specification
☆46Updated 6 months ago
Alternatives and similar repositories for open-inference-protocol:
Users that are interested in open-inference-protocol are comparing it to the libraries listed below
- Controller for ModelMesh☆217Updated last month
- User documentation for KServe.☆104Updated this week
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆202Updated 2 months ago
- ☆104Updated this week
- Distributed Model Serving Framework☆159Updated 4 months ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆89Updated this week
- Kubeflow Pipelines on Tekton☆176Updated 2 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆186Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆82Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆231Updated this week
- Holistic job manager on Kubernetes☆111Updated 11 months ago
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated 3 weeks ago
- ☆101Updated this week
- K8s device plugin for GPU sharing☆99Updated last year
- Argoflow has been superseded by deployKF☆136Updated last year
- GenAI inference performance benchmarking tool☆16Updated this week
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆91Updated 10 months ago
- Kubernetes release optimizer☆256Updated 6 months ago
- kfctl is a CLI for deploying and managing Kubeflow☆183Updated last year
- A top-like tool for monitoring GPUs in a cluster☆84Updated last year
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆312Updated this week
- ☆49Updated 11 months ago
- KServe models web UI☆36Updated this week
- Seldon Core Operator for Kubernetes☆12Updated 5 years ago
- Helm charts for the KubeRay project☆38Updated this week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆203Updated last year
- MLFlow Deployment Plugin for Ray Serve☆43Updated 2 years ago
- ☆27Updated this week
- This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project☆88Updated 10 months ago
- Fybrik☆132Updated last year