JobSet: a k8s native API for distributed ML training and HPC workloads
☆323May 1, 2026Updated this week
Alternatives and similar repositories for jobset
Users that are interested in jobset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆712Updated this week
- Kubernetes-native Job Queueing☆2,486Updated this week
- Gateway API Inference Extension☆660Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- ☆50Mar 25, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆302Jan 26, 2026Updated 3 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆127Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆524Apr 14, 2026Updated 3 weeks ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Holistic job manager on Kubernetes☆117Feb 20, 2024Updated 2 years ago
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,290Apr 21, 2026Updated 2 weeks ago
- An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.☆211Updated this week
- data plane testing utility of cloud native☆222Apr 20, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- All the things to make the scheduler extendable with wasm.☆130Nov 24, 2025Updated 5 months ago
- AppWrapper controller for Kueue☆17Updated this week
- ☆297Apr 16, 2026Updated 2 weeks ago
- DRA Driver for NVIDIA GPUs☆637Updated this week
- High fidelity and scalable capacity and usage metrics for Kubernetes clusters☆135Mar 4, 2025Updated last year
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 5 months ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 5 months ago
- KJob: Tool for CLI-loving ML researchers☆42Mar 31, 2026Updated last month
- A toolkit to run Ray applications on Kubernetes☆2,476Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,095Updated this week
- This repository contains a Kubernetes controller that manages node taints based on multiple readiness conditions, providing fine-grained …☆137Updated this week
- Layer4 egress gateway for Kubernetes☆290Apr 13, 2026Updated 3 weeks ago
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,255Updated this week
- Node Resource Interface☆377Apr 28, 2026Updated last week
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week
- Federated middleware based on Karmada☆49Nov 20, 2023Updated 2 years ago
- Node feature discovery for Kubernetes☆1,026Apr 28, 2026Updated last week
- An Open Standard for Packaging, Distributing and Running LLMs in Cloud-Native Environments☆196Apr 28, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.☆3,106Apr 27, 2026Updated last week
- d.run website☆17Apr 20, 2026Updated 2 weeks ago
- Kubernetes network policies reference implementation☆76Apr 27, 2026Updated last week
- ☆20Mar 11, 2026Updated last month
- A Cloud Native Batch System (Project under CNCF)☆5,530Updated this week
- The Encyclopedia of Kubernetes clusters☆870Updated this week
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year