JobSet: a k8s native API for distributed ML training and HPC workloads
☆326Jun 15, 2026Updated this week
Alternatives and similar repositories for jobset
Users that are interested in jobset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆741Updated this week
- Kubernetes-native Job Queueing☆2,554Updated this week
- Gateway API Inference Extension☆688Jun 9, 2026Updated last week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- ☆52Mar 25, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆305Jan 26, 2026Updated 4 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆131Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆528Updated this week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆13May 16, 2023Updated 3 years ago
- An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.☆244Updated this week
- Holistic job manager on Kubernetes☆117Feb 20, 2024Updated 2 years ago
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,296Updated this week
- data plane testing utility of cloud native☆222May 28, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- All the things to make the scheduler extendable with wasm.☆129Nov 24, 2025Updated 6 months ago
- AppWrapper controller for Kueue☆17May 22, 2026Updated 3 weeks ago
- ☆299Apr 16, 2026Updated last month
- High fidelity and scalable capacity and usage metrics for Kubernetes clusters☆135Mar 4, 2025Updated last year
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13May 31, 2026Updated 2 weeks ago
- DRA Driver for NVIDIA GPUs☆654Updated this week
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 6 months ago
- KJob: Tool for CLI-loving ML researchers☆43Jun 1, 2026Updated 2 weeks ago
- A toolkit to run Ray applications on Kubernetes☆2,542Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,112Updated this week
- This repository contains a Kubernetes controller that manages node taints based on multiple readiness conditions, providing fine-grained …☆147Jun 5, 2026Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,306Updated this week
- Layer4 egress gateway for Kubernetes☆297May 29, 2026Updated 2 weeks ago
- Node Resource Interface☆389Jun 5, 2026Updated last week
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week
- Federated middleware based on Karmada☆49Nov 20, 2023Updated 2 years ago
- Node feature discovery for Kubernetes☆1,045Jun 7, 2026Updated last week
- An Open Standard for Packaging, Distributing and Running LLMs in Cloud-Native Environments☆208Jun 8, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.☆3,122Updated this week
- d.run website☆17Updated this week
- Kubernetes network policies reference implementation☆78Jun 7, 2026Updated last week
- ☆21Mar 11, 2026Updated 3 months ago
- A Cloud Native Batch System (Project under CNCF)☆5,671Updated this week
- The Encyclopedia of Kubernetes clusters☆872May 30, 2026Updated 2 weeks ago
- WG Serving☆37Mar 24, 2026Updated 2 months ago