kubernetes-sigs / kjobLinks
KJob: Tool for CLI-loving ML researchers
☆37Updated this week
Alternatives and similar repositories for kjob
Users that are interested in kjob are comparing it to the libraries listed below
Sorting:
- Operator for managing Node Feature Discovery deployment☆71Updated 2 months ago
- ☆163Updated 2 weeks ago
- ☆38Updated last week
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- Distributed KV cache coordinator☆46Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆85Updated last week
- DraNet is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆100Updated this week
- ☆39Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆29Updated 8 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆120Updated this week
- Automatic repair for unhealthy Kubernetes nodes☆53Updated 3 weeks ago
- Holistic job manager on Kubernetes☆117Updated last year
- API for coordinating Maintenance in Kubernetes.☆26Updated 3 weeks ago
- Container Object Storage Interface (COSI) API responsible to define API for COSI objects. NOTE: The content of this repo has been moved t…☆70Updated 8 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆69Updated 3 weeks ago
- ☆52Updated 3 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆249Updated this week
- Behaviour driven reconciler testing framework.☆30Updated last week
- Container Object Storage (COSI) Specification. NOTE: The content of this repo has been moved to https://github.com/kubernetes-sigs/contai…☆63Updated 8 months ago
- CNI DRA Driver☆27Updated 6 months ago
- Prototypes and experiments for WG Device Management.☆11Updated 8 months ago
- Operator that deploys additional KubeVirt resources☆34Updated this week
- GenAI inference performance benchmarking tool☆71Updated this week
- Container Object Storage Interface (COSI) controller responsible to manage lifecycle of COSI objects. NOTE: The content of this repo has …☆94Updated 8 months ago
- All the things to make the scheduler extendable with wasm.☆129Updated last month
- InterLink aims to provide an abstraction for the execution of a Kubernetes pod on any remote resource capable of managing a Container exe…☆84Updated this week
- An etcd operator to configure, provision, reconcile and monitor etcd clusters.☆91Updated this week
- Provides a general service to support image acceleration based on kinds of accelerator like Nydus and eStargz etc.☆92Updated last month
- IP Over Infiniband (IPoIB) CNI Plugin☆15Updated last week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆32Updated last week