KJob: Tool for CLI-loving ML researchers
☆40Dec 29, 2025Updated 2 months ago
Alternatives and similar repositories for kjob
Users that are interested in kjob are comparing it to the libraries listed below
Sorting:
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- Simplified Data Management and Sharing for Kubernetes☆17Updated this week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Prototypes and experiments for WG Device Management.☆15Feb 11, 2026Updated 3 weeks ago
- CPU DRA Driver☆32Feb 27, 2026Updated last week
- Experimental DRA driver bringing CNI closer to Kubernetes☆39Oct 1, 2025Updated 5 months ago
- Running and managing Wasm(actors) and capability providers in Kubernetes☆31Dec 12, 2023Updated 2 years ago
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week
- Deploy a Flux MiniCluster to Kubernetes with the operator☆40Jan 9, 2026Updated 2 months ago
- [Moved to https://github.com/kubernetes-sigs/kwok] fake-k8s is a tool for running Fake Kubernetes clusters, It can be used as an alternat…☆19Jan 6, 2023Updated 3 years ago
- A Rust port of BuntDB☆15Aug 5, 2021Updated 4 years ago
- ☆16Apr 22, 2025Updated 10 months ago
- OpenAPI Golang client library for Slurm REST API. A Slinky project.☆23Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆63Updated this week
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Backend server for envd☆22Dec 18, 2023Updated 2 years ago
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆59Oct 17, 2024Updated last year
- JobSet: a k8s native API for distributed ML training and HPC workloads☆317Mar 2, 2026Updated last week
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆27Jan 9, 2026Updated last month
- Go client for Casbin-Server☆27Jul 20, 2025Updated 7 months ago
- 🧬 The adaptive model routing system for exploration and exploitation.☆22Jan 4, 2026Updated 2 months ago
- Easily share pprof formatted profiles from your terminal.☆33Oct 12, 2022Updated 3 years ago
- ☆34Mar 1, 2026Updated last week
- Kubernetes 源码学习笔记 🔭☆23Apr 5, 2022Updated 3 years ago
- Operator to manage cloud Ingress/Load balancer scheme, switching between public/private modes☆33Updated this week
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- Framework and scripts to create multiple Kubernetes clusters with kind (K8s in Docker) for local E2E testing and development.☆72Updated this week
- A tool for coordinated checkpoint/restore of distributed applications with CRIU☆31Updated this week
- Contains the API definitions used by OLM and Marketplace☆34Updated this week
- KCL Example Repository☆33Apr 8, 2025Updated 11 months ago
- WG Serving☆34Dec 15, 2025Updated 2 months ago
- Coming soon. Do not import.☆27Feb 13, 2026Updated 3 weeks ago
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆150Mar 1, 2026Updated last week
- Documentation repository for NVIDIA Cloud Native Technologies☆37Mar 2, 2026Updated last week
- Kubernetes operator for bpfman☆33Jan 26, 2026Updated last month
- The kernel module management operator builds, signs and loads kernel modules on OpenShift.☆32Updated this week
- A toolkit for discovering cluster network topology.☆102Updated this week
- Operator for managing Node Feature Discovery deployment☆74Jan 29, 2026Updated last month