kubernetes-sigs / kjob
KJob: Tool for CLI-loving ML researchers
☆20Updated this week
Alternatives and similar repositories for kjob:
Users that are interested in kjob are comparing it to the libraries listed below
- Container Object Storage Interface (COSI) provisioner responsible to interface with COSI drivers. NOTE: The content of this repo has bee…☆35Updated 2 months ago
- Operator for managing Node Feature Discovery deployment☆68Updated 2 months ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆27Updated last year
- ☆37Updated this week
- A CRD for arbitrary properties about a cluster☆34Updated 2 weeks ago
- Operator that deploys additional KubeVirt resources☆32Updated this week
- hub / spoke registration controllers☆42Updated 4 months ago
- Container Object Storage (COSI) Specification. NOTE: The content of this repo has been moved to https://github.com/kubernetes-sigs/contai…☆64Updated 2 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆57Updated this week
- Low level generic controller framework☆53Updated this week
- Behaviour driven reconciler testing framework.☆30Updated this week
- ☆12Updated last year
- Automatic repair for unhealthy Kubernetes nodes☆50Updated 3 weeks ago
- Storage operator for Kubernetes☆43Updated 3 weeks ago
- Container Object Storage Interface (COSI) API responsible to define API for COSI objects. NOTE: The content of this repo has been moved t…☆70Updated 2 months ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆25Updated 4 years ago
- Container Object Storage Interface (COSI) controller responsible to manage lifecycle of COSI objects. NOTE: The content of this repo has …☆95Updated 2 months ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆26Updated last month
- A tool to help you migrate Kubernetes CustomResourceDefinition data across API groups and namespaces☆46Updated 3 years ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12Updated last year
- Holistic job manager on Kubernetes☆111Updated last year
- Kubernetes Work API☆61Updated this week
- ☆52Updated this week
- ☆52Updated this week
- ☆34Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆187Updated this week
- This repo contains sidecar controller and agent for volume health monitoring.☆67Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆61Updated 3 weeks ago
- Cluster-wide IPAM CNI plugin☆35Updated this week