kserve / open-inference-protocolLinks
Repository for open inference protocol specification
☆56Updated last month
Alternatives and similar repositories for open-inference-protocol
Users that are interested in open-inference-protocol are comparing it to the libraries listed below
Sorting:
- Distributed Model Serving Framework☆173Updated 3 weeks ago
- Controller for ModelMesh☆232Updated 2 weeks ago
- User documentation for KServe.☆106Updated this week
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated 2 weeks ago
- GenAI inference performance benchmarking tool☆58Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆114Updated this week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆130Updated this week
- Helm charts for llm-d☆43Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆98Updated last week
- Kubeflow Pipelines on Tekton☆182Updated 7 months ago
- Gateway API Inference Extension☆351Updated this week
- MLFlow Deployment Plugin for Ray Serve☆45Updated 3 years ago
- KServe models web UI☆38Updated last week
- MLOps Python Library☆119Updated 3 years ago
- Helm charts for the KubeRay project☆44Updated this week
- ☆157Updated last week
- Kubeflow SDK for ML Experience☆14Updated this week
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆212Updated last month
- Argoflow has been superseded by deployKF☆137Updated last year
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆93Updated last year
- A top-like tool for monitoring GPUs in a cluster☆84Updated last year
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆28Updated 6 months ago
- Seldon Core Operator for Kubernetes☆12Updated 5 years ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆237Updated this week
- ☆37Updated this week
- Chassis turns machine learning models into portable container images that can run just about anywhere.☆86Updated last year
- AppWrapper controller for Kueue☆15Updated 2 weeks ago
- KServe V2 Protocol Rest API Implementation Proxy☆11Updated 2 weeks ago
- A toolkit for discovering cluster network topology.☆54Updated this week