JobSet: a k8s native API for distributed ML training and HPC workloads
☆324May 23, 2026Updated this week
Alternatives and similar repositories for jobset
Users that are interested in jobset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆727Updated this week
- Kubernetes-native Job Queueing☆2,505May 19, 2026Updated last week
- Gateway API Inference Extension☆675May 19, 2026Updated last week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- ☆52Mar 25, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆305Jan 26, 2026Updated 4 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆130Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆528May 18, 2026Updated last week
- An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.☆217Updated this week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 3 years ago
- Holistic job manager on Kubernetes☆117Feb 20, 2024Updated 2 years ago
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,293May 11, 2026Updated 2 weeks ago
- data plane testing utility of cloud native☆222May 19, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- All the things to make the scheduler extendable with wasm.☆129Nov 24, 2025Updated 6 months ago
- AppWrapper controller for Kueue☆17May 15, 2026Updated last week
- ☆299Apr 16, 2026Updated last month
- DRA Driver for NVIDIA GPUs☆645May 20, 2026Updated last week
- High fidelity and scalable capacity and usage metrics for Kubernetes clusters☆135Mar 4, 2025Updated last year
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13May 7, 2026Updated 2 weeks ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 6 months ago
- KJob: Tool for CLI-loving ML researchers☆43Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,508Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,105Updated this week
- This repository contains a Kubernetes controller that manages node taints based on multiple readiness conditions, providing fine-grained …☆141Updated this week
- Layer4 egress gateway for Kubernetes☆295May 11, 2026Updated 2 weeks ago
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,279Updated this week
- Node Resource Interface☆382Apr 28, 2026Updated 3 weeks ago
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week
- Federated middleware based on Karmada☆49Nov 20, 2023Updated 2 years ago
- Node feature discovery for Kubernetes☆1,037Updated this week
- An Open Standard for Packaging, Distributing and Running LLMs in Cloud-Native Environments☆203May 19, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.☆3,111May 19, 2026Updated last week
- d.run website☆17May 18, 2026Updated last week
- Kubernetes network policies reference implementation☆77May 16, 2026Updated last week
- ☆20Mar 11, 2026Updated 2 months ago
- A Cloud Native Batch System (Project under CNCF)☆5,594Updated this week
- The Encyclopedia of Kubernetes clusters☆870May 13, 2026Updated 2 weeks ago
- WG Serving☆37Mar 24, 2026Updated 2 months ago