kubernetes-sigs / kjobLinks
KJob: Tool for CLI-loving ML researchers
☆39Updated last week
Alternatives and similar repositories for kjob
Users that are interested in kjob are comparing it to the libraries listed below
Sorting:
- Example DRA driver that developers can fork and modify to get them started writing their own.☆107Updated 3 weeks ago
- Operator for managing Node Feature Discovery deployment☆73Updated 3 months ago
- K8s device plugin for GPU sharing☆99Updated 2 years ago
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆49Updated last week
- ☆180Updated this week
- Operator that deploys additional KubeVirt resources☆38Updated this week
- Container Object Storage Interface (COSI) API responsible to define API for COSI objects. NOTE: The content of this repo has been moved t…☆70Updated 11 months ago
- WG Serving☆31Updated last month
- ☆52Updated last week
- CNI DRA Driver☆31Updated last month
- ☆40Updated 2 weeks ago
- Prototypes and experiments for WG Device Management.☆12Updated 2 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆281Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆136Updated this week
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆146Updated this week
- Container Object Storage (COSI) Specification. NOTE: The content of this repo has been moved to https://github.com/kubernetes-sigs/contai…☆63Updated 11 months ago
- ☆38Updated this week
- Automatic repair for unhealthy Kubernetes nodes☆57Updated this week
- Holistic job manager on Kubernetes☆116Updated last year
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated 11 months ago
- Container Object Storage Interface (COSI) controller responsible to manage lifecycle of COSI objects. NOTE: The content of this repo has …☆93Updated 11 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆47Updated this week
- Behaviour driven reconciler testing framework.☆30Updated this week
- AppWrapper controller for Kueue☆16Updated 2 weeks ago
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆94Updated 3 weeks ago
- API for coordinating Maintenance in Kubernetes.☆26Updated 4 months ago
- Simplified model deployment on llm-d☆27Updated 4 months ago
- Container Object Storage Interface (COSI) provisioner responsible to interface with COSI drivers. NOTE: The content of this repo has bee…☆34Updated 11 months ago
- Installs and maintains the kube-scheduler on a cluster.☆36Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆71Updated 4 months ago