JobSet: a k8s native API for distributed ML training and HPC workloads
☆321Mar 19, 2026Updated last week
Alternatives and similar repositories for jobset
Users that are interested in jobset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆682Mar 21, 2026Updated last week
- Kubernetes-native Job Queueing☆2,399Updated this week
- Gateway API Inference Extension☆616Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- ☆48Dec 8, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆292Jan 26, 2026Updated 2 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆124Feb 23, 2026Updated last month
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆519Mar 19, 2026Updated last week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Holistic job manager on Kubernetes☆116Feb 20, 2024Updated 2 years ago
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,284Mar 19, 2026Updated last week
- An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.☆62Mar 20, 2026Updated last week
- data plane testing utility of cloud native☆222Feb 23, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- All the things to make the scheduler extendable with wasm.☆130Nov 24, 2025Updated 4 months ago
- AppWrapper controller for Kueue☆17Mar 20, 2026Updated last week
- ☆294Mar 9, 2026Updated 2 weeks ago
- High fidelity and scalable capacity and usage metrics for Kubernetes clusters☆132Mar 4, 2025Updated last year
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 3 months ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 4 months ago
- KJob: Tool for CLI-loving ML researchers☆41Dec 29, 2025Updated 2 months ago
- NVIDIA DRA Driver for GPUs☆593Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,388Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository contains a Kubernetes controller that manages node taints based on multiple readiness conditions, providing fine-grained …☆120Mar 18, 2026Updated last week
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,063Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,191Updated this week
- Layer4 egress gateway for Kubernetes☆286Mar 16, 2026Updated last week
- Node Resource Interface☆371Mar 17, 2026Updated last week
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Mar 20, 2026Updated last week
- Federated middleware based on Karmada☆49Nov 20, 2023Updated 2 years ago
- Node feature discovery for Kubernetes☆1,008Mar 20, 2026Updated last week
- An Open Standard for Packaging, Distributing and Running LLMs in Cloud-Native Environments☆186Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.☆3,072Updated this week
- d.run website☆16Mar 13, 2026Updated 2 weeks ago
- Kubernetes network policies reference implementation☆73Mar 17, 2026Updated last week
- ☆18Mar 11, 2026Updated 2 weeks ago
- A Cloud Native Batch System (Project under CNCF)☆5,395Mar 19, 2026Updated last week
- The Encyclopedia of Kubernetes clusters☆860Mar 4, 2026Updated 3 weeks ago
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year