kserve / modelmesh-runtime-adapter
Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods
☆21Updated last month
Related projects ⓘ
Alternatives and complementary repositories for modelmesh-runtime-adapter
- Distributed Model Serving Framework☆154Updated 3 weeks ago
- Controller for ModelMesh☆203Updated 3 months ago
- ☆84Updated this week
- User documentation for KServe.☆105Updated this week
- Repository for open inference protocol specification☆42Updated 3 months ago
- Kubeflow Pipelines on Tekton☆173Updated last week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes☆25Updated this week
- Holistic job manager on Kubernetes☆108Updated 8 months ago
- A repository for Open Data Hub Kustomize manifests extending upstream Kubeflow manifests☆62Updated last year
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆74Updated 2 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆143Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆50Updated this week
- ☆19Updated this week
- LLM Instance gateway implementation.☆64Updated this week
- MCAD v2☆10Updated 6 months ago
- KAR: A Runtime for the Hybrid Cloud☆28Updated last month
- WG Serving☆12Updated last week
- Seldon Core Operator for Kubernetes☆12Updated 5 years ago
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆194Updated 3 months ago
- ModelMesh Performance Scripts, Dashboard and Pipelines☆11Updated 2 years ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆130Updated this week
- Journey to open platform for digital bank modernization. A reference implementation of BIAN to start, providing documentation and artifac…☆15Updated 2 years ago
- Fybrik☆131Updated last year
- Test infrastructure and tooling for Kubeflow.☆63Updated 3 months ago
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆19Updated 3 weeks ago
- Open Data Hub operator to manage ODH component integrations☆60Updated this week
- ☆50Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆23Updated 3 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆57Updated last month