kserve / open-inference-protocol
Repository for open inference protocol specification
☆45Updated 5 months ago
Alternatives and similar repositories for open-inference-protocol:
Users that are interested in open-inference-protocol are comparing it to the libraries listed below
- Controller for ModelMesh☆213Updated last week
- User documentation for KServe.☆103Updated this week
- Distributed Model Serving Framework☆156Updated 3 months ago
- ☆97Updated this week
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆198Updated last month
- JobSet: a k8s native API for distributed ML training and HPC workloads☆167Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆168Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆74Updated this week
- Kubeflow Pipelines on Tekton☆175Updated last month
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆86Updated this week
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated last month
- K8s device plugin for GPU sharing☆99Updated last year
- kfctl is a CLI for deploying and managing Kubeflow☆182Updated last year
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆91Updated 9 months ago
- Holistic job manager on Kubernetes☆110Updated 10 months ago
- Argoflow has been superseded by deployKF☆136Updated last year
- ☆95Updated 3 weeks ago
- MLOps Python Library☆117Updated 2 years ago
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆203Updated last year
- KServe models web UI☆35Updated 4 months ago
- GenAI inference performance benchmarking tool☆10Updated this week
- A top-like tool for monitoring GPUs in a cluster☆83Updated 11 months ago
- Kubernetes release optimizer☆256Updated 5 months ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆451Updated this week
- Fybrik☆131Updated last year
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆301Updated this week
- MLFlow Deployment Plugin for Ray Serve☆43Updated 2 years ago
- Containerization and cloud native suite for OPEA☆33Updated this week