NVIDIA / cloud-native-stackLinks
Run cloud native workloads on NVIDIA GPUs
☆190Updated 3 weeks ago
Alternatives and similar repositories for cloud-native-stack
Users that are interested in cloud-native-stack are comparing it to the libraries listed below
Sorting:
- MIG Partition Editor for NVIDIA GPUs☆209Updated this week
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆126Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆124Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆303Updated last year
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆561Updated this week
- NVIDIA Network Operator☆270Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆491Updated last week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆107Updated last week
- Share GPU between Pods in Kubernetes☆211Updated 2 years ago
- NVIDIA DRA Driver for GPUs☆413Updated this week
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆337Updated 2 weeks ago
- A Slurm cluster for Kubernetes☆62Updated last year
- ☆26Updated 2 weeks ago
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 3 years ago
- NVIDIA NCCL Tests for Distributed Training☆105Updated this week
- markdown docs☆90Updated this week
- ☆256Updated last week
- Controller for ModelMesh☆239Updated 2 months ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆237Updated last week
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes☆140Updated this week
- ☆64Updated last week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆141Updated this week
- This repo includes everything you need to know about deploying GPU nodes on OCI☆34Updated this week
- ☆43Updated last year
- Holistic job manager on Kubernetes☆116Updated last year
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆205Updated last year
- NVIDIA k8s device plugin for Kubevirt☆261Updated last week
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆84Updated last year
- Run Slurm on Kubernetes. A Slinky project.☆151Updated last week
- Golang bindings for Nvidia Datacenter GPU Manager (DCGM)☆128Updated this week