kserve / open-inference-protocolLinks
Repository for open inference protocol specification
☆56Updated 3 weeks ago
Alternatives and similar repositories for open-inference-protocol
Users that are interested in open-inference-protocol are comparing it to the libraries listed below
Sorting:
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆128Updated this week
- Controller for ModelMesh☆230Updated 3 weeks ago
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated 3 weeks ago
- User documentation for KServe.☆106Updated last week
- Distributed Model Serving Framework☆168Updated 3 weeks ago
- Helm charts for llm-d☆35Updated last week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆98Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆107Updated this week
- Kubeflow Pipelines on Tekton☆182Updated 6 months ago
- GenAI inference performance benchmarking tool☆45Updated this week
- MLFlow Deployment Plugin for Ray Serve☆45Updated 3 years ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆229Updated this week
- Gateway API Inference Extension☆317Updated this week
- KServe models web UI☆38Updated this week
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆209Updated last month
- Helm charts for the KubeRay project☆43Updated last month
- A top-like tool for monitoring GPUs in a cluster☆85Updated last year
- MLOps Python Library☆119Updated 3 years ago
- Chassis turns machine learning models into portable container images that can run just about anywhere.☆86Updated last year
- Fybrik☆132Updated last year
- Argoflow has been superseded by deployKF☆137Updated last year
- AppWrapper controller for Kueue☆13Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 6 months ago
- kfctl is a CLI for deploying and managing Kubeflow☆184Updated last year
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆79Updated this week
- ☆215Updated this week
- Holistic job manager on Kubernetes☆115Updated last year
- ☆34Updated 2 weeks ago
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- This repository contains example integrations between Determined and other ML products☆48Updated last year