NVIDIA/cloud-native-stack

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/cloud-native-stack)

NVIDIA / cloud-native-stack

Run cloud native workloads on NVIDIA GPUs

☆239

Alternatives and similar repositories for cloud-native-stack

Users that are interested in cloud-native-stack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA / k8s-nim-operator
View on GitHub
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆159Updated this week
NVIDIA / k8s-operator-libs
View on GitHub
A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.
☆30Updated this week
kubernetes-sigs / dra-driver-nvidia-gpu
View on GitHub
DRA Driver for NVIDIA GPUs
☆674Updated this week
NVIDIA / gpu-feature-discovery
View on GitHub
GPU plugin to the node feature discovery for Kubernetes
☆309May 27, 2024Updated 2 years ago
NVIDIA / gpu-operator
View on GitHub
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
☆2,795Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
NVIDIA / ngc-container-replicator
View on GitHub
NGC Container Replicator
☆29Dec 26, 2022Updated 3 years ago
NVIDIA / NVIDIA_AI_Enterprise_AzureML
View on GitHub
Source Code and Usage Samples for the Resources hosted in the NVIDIA AI Enterprise AzureML Registry
☆21Aug 7, 2024Updated last year
jaredhocutt / openshift-provision
View on GitHub
Provision infrastructure and install OpenShift 3.
☆25Sep 23, 2021Updated 4 years ago
Mellanox / nic-configuration-operator
View on GitHub
NVIDIA Networking NIC Configuration Operator For Kubernetes
☆22Updated this week
kai-scheduler / KAI-Scheduler
View on GitHub
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆1,401Updated this week
NVIDIA / gpu-driver-container
View on GitHub
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
☆179Updated this week
NVIDIA / deepops
View on GitHub
Tools for building GPU clusters
☆1,462Updated this week
run-ai / fake-gpu-operator
View on GitHub
☆295Jul 5, 2026Updated 2 weeks ago
Mellanox / network-operator
View on GitHub
NVIDIA Network Operator
☆356Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Mellanox / rshim-user-space
View on GitHub
Linux based user-space RSHIM driver for the Mellanox BlueField SoC
☆35Jul 9, 2026Updated last week
NVIDIA / mig-parted
View on GitHub
MIG Partition Editor for NVIDIA GPUs
☆260Updated this week
openshift / sriov-network-operator
View on GitHub
SR-IOV Network Operator
☆152Updated this week
NVIDIA / ais-k8s
View on GitHub
Kubernetes Operator, Helm Charts, Ansible Playbooks, and utility scripts for large-scale AIStore deployments on Kubernetes.
☆132Updated this week
NVIDIA / dgx-selinux
View on GitHub
DGX RHEL SELinux Policies
☆19Apr 12, 2024Updated 2 years ago
NVIDIA / nim-deploy
View on GitHub
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…
☆240May 15, 2026Updated 2 months ago
NVIDIA / gpu-usage-monitor
View on GitHub
A comprehensive Helm chart for monitoring GPU resources in Kubernetes clusters. This tool provides real-time visibility into GPU allocati…
☆28Jun 30, 2026Updated 2 weeks ago
Mellanox / ib-kubernetes
View on GitHub
☆78Updated this week
kubernetes-sigs / dra-example-driver
View on GitHub
Example DRA driver that developers can fork and modify to get them started writing their own.
☆136Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
NVIDIA / nv-cloud-function-helpers
View on GitHub
Functions that simplify common tasks with NVIDIA Cloud Functions
☆20May 5, 2026Updated 2 months ago
RenaudWasTaken / cdi
View on GitHub
Container Device Interface - Devices for Linux containers
☆10Jul 25, 2020Updated 5 years ago
NVIDIA / enroot
View on GitHub
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
☆978Jun 9, 2026Updated last month
NVIDIA / k8s-kata-manager
View on GitHub
☆22May 26, 2026Updated last month
cncf-tags / container-device-interface
View on GitHub
☆305Updated this week
rhpds / poolboy
View on GitHub
Operator for managing resource claims and provisioning
☆11Updated this week
NVIDIA / libnvidia-container
View on GitHub
NVIDIA container runtime library
☆1,117Updated this week
llm-d-incubation / llm-d-modelservice
View on GitHub
helm charts for deploying models with llm-d
☆31Jun 27, 2026Updated 3 weeks ago
NVIDIA / k8s-driver-manager
View on GitHub
The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
☆55Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
NVIDIA / knavigator
View on GitHub
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆79Jul 6, 2026Updated 2 weeks ago
Mellanox / nvidia-k8s-ipam
View on GitHub
IPAM plugin for kubernetes
☆35Updated this week
NVIDIA / k8s-device-plugin
View on GitHub
NVIDIA device plugin for Kubernetes
☆3,820Updated this week
dswarbrick / fabricmon
View on GitHub
InfiniBand fabric monitoring daemon written in Go
☆32May 22, 2025Updated last year
NVIDIA / go-gpuallocator
View on GitHub
Go Abstraction for Allocating NVIDIA GPUs with Custom Policies
☆123Updated this week
redhat-nfvpe / cni-route-override
View on GitHub
CNI plugin to override routes for a container interface
☆41May 27, 2026Updated last month
NVIDIA / nvidia-terraform-modules
View on GitHub
Infrastructure as code for GPU accelerated managed Kubernetes clusters.
☆60Apr 30, 2025Updated last year