kserve / open-inference-protocol
Repository for open inference protocol specification
☆55Updated this week
Alternatives and similar repositories for open-inference-protocol
Users that are interested in open-inference-protocol are comparing it to the libraries listed below
Sorting:
- User documentation for KServe.☆106Updated last week
- Controller for ModelMesh☆229Updated last week
- Distributed Model Serving Framework☆165Updated last week
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆209Updated 2 weeks ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆125Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆96Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆103Updated this week
- GenAI inference performance benchmarking tool☆41Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆226Updated last week
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated last week
- Gateway API Inference Extension☆272Updated this week
- KServe models web UI☆38Updated this week
- Kubeflow Pipelines on Tekton☆180Updated 5 months ago
- kfctl is a CLI for deploying and managing Kubeflow☆184Updated last year
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆92Updated last year
- K8s device plugin for GPU sharing☆100Updated 2 years ago
- ☆150Updated 3 weeks ago
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆549Updated this week
- Helm charts for the KubeRay project☆43Updated last month
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆428Updated 2 weeks ago
- MLFlow Deployment Plugin for Ray Serve☆44Updated 3 years ago
- A toolkit for discovering cluster network topology.☆46Updated 2 weeks ago
- AppWrapper controller for Kueue☆13Updated 2 weeks ago
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆355Updated this week
- MLOps Python Library☆119Updated 3 years ago
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆246Updated this week
- Holistic job manager on Kubernetes☆115Updated last year
- Fybrik☆132Updated last year
- Chassis turns machine learning models into portable container images that can run just about anywhere.☆86Updated last year
- Argoflow has been superseded by deployKF☆137Updated last year