kubeflow / sdkLinks
Universal Python SDK to run AI workloads on Kubernetes
☆72Updated this week
Alternatives and similar repositories for sdk
Users that are interested in sdk are comparing it to the libraries listed below
Sorting:
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆161Updated this week
- Kubeflow Notebooks runs interactive development environments for AI, ML, and Data workloads on Kubernetes.☆65Updated this week
- Controller for ModelMesh☆242Updated 8 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆146Updated this week
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆159Updated this week
- Repository for open inference protocol specification☆64Updated 8 months ago
- User documentation for KServe.☆109Updated last week
- Helm charts for the KubeRay project☆59Updated 2 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆304Updated last week
- Distributed Model Serving Framework☆185Updated 4 months ago
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆222Updated last month
- Kubernetes AI Conformance☆157Updated last week
- GenAI inference performance benchmarking tool☆142Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,111Updated this week
- Helm charts for llm-d☆52Updated 6 months ago
- KServe models web UI☆47Updated this week
- Kubeflow Pipelines on Tekton☆182Updated last year
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆662Updated last week
- Gateway API Inference Extension☆576Updated this week
- Run Slurm in Kubernetes☆358Updated this week
- WG Serving☆34Updated last month
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆124Updated this week
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆365Updated this week
- ☆191Updated 3 weeks ago
- A toolkit for discovering cluster network topology.☆96Updated last week
- ☆280Updated this week
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- 🎉 An awesome & curated list of best LLMOps tools.☆190Updated this week
- NVIDIA DRA Driver for GPUs☆557Updated this week
- ☆44Updated last week