NVIDIA / cloud-native-stackLinks
Run cloud native workloads on NVIDIA GPUs
☆180Updated last month
Alternatives and similar repositories for cloud-native-stack
Users that are interested in cloud-native-stack are comparing it to the libraries listed below
Sorting:
- MIG Partition Editor for NVIDIA GPUs☆201Updated last week
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆525Updated last month
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆98Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆114Updated this week
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆113Updated this week
- NVIDIA Network Operator☆257Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆300Updated last year
- CloudAI Benchmark Framework☆66Updated this week
- NVIDIA NCCL Tests for Distributed Training☆97Updated this week
- ☆251Updated 2 weeks ago
- Share GPU between Pods in Kubernetes☆209Updated 2 years ago
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆374Updated this week
- Tools to deploy GPU clusters in the Cloud☆31Updated 2 years ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆235Updated this week
- A Slurm cluster for Kubernetes☆60Updated 10 months ago
- ☆24Updated last month
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 3 years ago
- NVIDIA k8s device plugin for Kubevirt☆257Updated 3 weeks ago
- Singularity implementation of k8s operator for interacting with SLURM.☆117Updated 4 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆482Updated last month
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes☆133Updated this week
- ☆270Updated this week
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆159Updated 6 years ago
- ☆43Updated last year
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆330Updated this week
- ☆353Updated last year
- Container plugin for Slurm Workload Manager☆344Updated 7 months ago
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆83Updated last year
- ☆62Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆176Updated last week