NVIDIA / nvidia-container-runtime
NVIDIA container runtime
☆1,107Updated last year
Related projects ⓘ
Alternatives and complementary repositories for nvidia-container-runtime
- NVIDIA container runtime library☆838Updated this week
- Tools for monitoring NVIDIA GPUs on Linux☆1,017Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆2,816Updated this week
- Tools for building GPU clusters☆1,262Updated 8 months ago
- Build and run containers leveraging NVIDIA GPUs☆2,445Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆910Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,409Updated 10 months ago
- GPU plugin to the node feature discovery for Kubernetes☆291Updated 5 months ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆470Updated last year
- A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.☆640Updated 2 weeks ago
- Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster☆272Updated 2 weeks ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆410Updated 2 months ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆440Updated 3 weeks ago
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆1,833Updated this week
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 2 years ago
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,605Updated this week
- PyTorch on Kubernetes☆306Updated 2 years ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆214Updated this week
- NVIDIA GPU Prometheus Exporter☆224Updated 3 years ago
- Optimized primitives for collective multi-GPU communication☆3,231Updated last month
- Multi-GPU CUDA stress test☆1,420Updated 2 months ago
- NVIDIA k8s device plugin for Kubevirt☆230Updated 3 weeks ago
- A repository for Kustomize manifests☆818Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆431Updated 2 months ago
- ☆829Updated 7 months ago
- A CLI for Kubeflow.☆737Updated this week
- Go Bindings for the NVIDIA Management Library (NVML)☆312Updated 3 weeks ago
- ☆115Updated 3 months ago
- Run cloud native workloads on NVIDIA GPUs☆133Updated last month
- This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kube…☆813Updated 2 years ago