NVIDIA/ais-k8s

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/ais-k8s)

NVIDIA / ais-k8s

Kubernetes Operator, Helm Charts, Ansible Playbooks, and utility scripts for large-scale AIStore deployments on Kubernetes.

☆132

Alternatives and similar repositories for ais-k8s

Users that are interested in ais-k8s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA / ais-etl
View on GitHub
Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on t…
☆21Updated this week
kubernetes-sigs / dra-driver-nvidia-gpu
View on GitHub
DRA Driver for NVIDIA GPUs
☆677Updated this week
NVIDIA / PixelView
View on GitHub
A compact and extensible image viewer
☆12Jun 22, 2020Updated 6 years ago
NVIDIA / k8s-operator-libs
View on GitHub
A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.
☆30Updated this week
NVIDIA / NVIDIA_AI_Enterprise_AzureML
View on GitHub
Source Code and Usage Samples for the Resources hosted in the NVIDIA AI Enterprise AzureML Registry
☆21Aug 7, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA / apt-packaging-fabric-manager
View on GitHub
Fabric Manager packaging for Debian
☆19Jun 25, 2021Updated 5 years ago
NVIDIA / fleet-command
View on GitHub
NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…
☆16Jul 20, 2022Updated 4 years ago
NVIDIA / topograph
View on GitHub
A toolkit for discovering cluster network topology.
☆148Updated this week
NVIDIA / go-nvlib
View on GitHub
A collection of useful Go libraries for use with NVIDIA GPU management tools
☆58Jul 16, 2026Updated last week
NVIDIA / gpu-operator
View on GitHub
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
☆2,807Updated this week
ralgozino / oh-my-kustomize
View on GitHub
Oh My ZSH Kustomize Plugin
☆15May 28, 2025Updated last year
NVIDIA / vgpu-device-manager
View on GitHub
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
☆161Updated this week
SlinkyProject / slurm-bridge
View on GitHub
Run Slurm as a Kubernetes scheduler. A Slinky project.
☆91Updated this week
kai-scheduler / KAI-Scheduler
View on GitHub
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆1,409Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Mellanox / network-operator
View on GitHub
NVIDIA Network Operator
☆357Updated this week
MikeZappa87 / dra-example
View on GitHub
☆17Jan 30, 2026Updated 5 months ago
NVIDIA / k8s-nim-operator
View on GitHub
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆159Updated this week
NVIDIA / mig-parted
View on GitHub
MIG Partition Editor for NVIDIA GPUs
☆259Updated this week
NVIDIA / knavigator
View on GitHub
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆79Jul 6, 2026Updated 2 weeks ago
NVIDIA / multi-storage-client
View on GitHub
Unified high-performance Python client for object and file stores.
☆79Updated this week
NVIDIA / cloud-native-stack
View on GitHub
Run cloud native workloads on NVIDIA GPUs
☆240Jul 14, 2026Updated last week
Mellanox / ipoib-cni
View on GitHub
IP Over Infiniband (IPoIB) CNI Plugin
☆18Updated this week
sighupio / module-aws
View on GitHub
AWS Module: additional components for EKS-based clusters on AWS
☆14Jul 7, 2026Updated 2 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Iceber / wasmcloud-ollama
View on GitHub
☆19Feb 7, 2024Updated 2 years ago
kubernetes-sigs / jobset
View on GitHub
JobSet: a k8s native API for distributed ML training and HPC workloads
☆334Updated this week
containers / netavark-dhcp-proxy-deprecated
View on GitHub
DHCP proxy for Netavark
☆11Jun 12, 2023Updated 3 years ago
NVIDIA / gpu-usage-monitor
View on GitHub
A comprehensive Helm chart for monitoring GPU resources in Kubernetes clusters. This tool provides real-time visibility into GPU allocati…
☆31Jun 30, 2026Updated 3 weeks ago
NVIDIA / workbench-example-multimodal-virtual-assistant
View on GitHub
An NVIDIA AI Workbench example project to build a multimodal virtual assistant
☆22Apr 17, 2025Updated last year
telegramdesktop / openal-soft
View on GitHub
OpenAL Soft is a software implementation of the OpenAL 3D audio API.
☆18May 15, 2025Updated last year
NVIDIA / k8s-device-plugin
View on GitHub
NVIDIA device plugin for Kubernetes
☆3,827Updated this week
NVIDIA / nvflow
View on GitHub
Workflow orchestration framework for end-to-end synthetic data generation (SDG), training (SFT), and evaluation pipelines built on NVIDIA…
☆18Jun 26, 2026Updated 3 weeks ago
NVIDIA / gpu-driver-container
View on GitHub
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
☆179Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
fsspec / alluxiofs
View on GitHub
Speed up fsspec data access with Alluxio distributed caching.
☆18Mar 22, 2026Updated 4 months ago
NVlabs / dlinputs
View on GitHub
Input pipelines for large scale, sharded training of deep learning models.
☆40Jun 18, 2019Updated 7 years ago
kubeflow / mpi-operator
View on GitHub
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
☆530Updated this week
NVIDIA / dcgm-exporter
View on GitHub
NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
☆1,811Updated this week
Mellanox / nic-configuration-operator
View on GitHub
NVIDIA Networking NIC Configuration Operator For Kubernetes
☆22Updated this week
NVIDIA / kubevirt-gpu-device-plugin
View on GitHub
NVIDIA k8s device plugin for Kubevirt
☆288Jul 15, 2026Updated last week
kubernetes-sigs / kueue
View on GitHub
Kubernetes-native Job Queueing
☆2,747Updated this week