NVIDIA / nvkindLinks
☆186Updated last month
Alternatives and similar repositories for nvkind
Users that are interested in nvkind are comparing it to the libraries listed below
Sorting:
- JobSet: a k8s native API for distributed ML training and HPC workloads☆292Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆111Updated 2 months ago
- GenAI inference performance benchmarking tool☆138Updated last week
- K8s device plugin for GPU sharing☆99Updated 2 years ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated 2 weeks ago
- llm-d helm charts and deployment examples☆48Updated 3 weeks ago
- WG Serving☆32Updated 2 weeks ago
- NVIDIA DRA Driver for GPUs☆523Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆642Updated last week
- Cloud Native Artifacial Intelligence Model Format Specification☆163Updated this week
- ☆62Updated last year
- ☆183Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆309Updated last year
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆158Updated last week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆87Updated last month
- KJob: Tool for CLI-loving ML researchers☆40Updated this week
- This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service co…☆251Updated last week
- Kubernetes Work API☆69Updated last month
- Gateway API Inference Extension☆554Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆73Updated 5 months ago
- All the things to make the scheduler extendable with wasm.☆129Updated last month
- Helm charts for llm-d☆50Updated 5 months ago
- This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project☆89Updated last year
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Updated 3 weeks ago
- Holistic job manager on Kubernetes☆115Updated last year
- Library for multi-cluster controllers with controller-runtime☆241Updated 2 weeks ago
- A toolkit for building declarative operators with kubebuilder☆262Updated 7 months ago
- Simplified model deployment on llm-d☆28Updated 6 months ago
- NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated compu…☆142Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆48Updated last week