Build and run containers leveraging NVIDIA GPUs
☆4,149Mar 17, 2026Updated this week
Alternatives and similar repositories for nvidia-container-toolkit
Users that are interested in nvidia-container-toolkit are comparing it to the libraries listed below
Sorting:
- NVIDIA container runtime library☆1,078Mar 12, 2026Updated last week
- Build and run Docker containers leveraging NVIDIA GPUs☆17,511Dec 6, 2023Updated 2 years ago
- NVIDIA device plugin for Kubernetes☆3,699Mar 13, 2026Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,590Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,648Feb 25, 2026Updated 3 weeks ago
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆162Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆73,479Updated this week
- NVIDIA container runtime☆1,123Oct 27, 2023Updated 2 years ago
- GPU plugin to the node feature discovery for Kubernetes☆307May 27, 2024Updated last year
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆685Feb 17, 2026Updated last month
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,120Updated this week
- ☆289Mar 9, 2026Updated last week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆8,953Jan 6, 2026Updated 2 months ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,800Mar 9, 2026Updated last week
- AIStore: scalable storage for AI applications☆1,779Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,446Updated this week
- NVIDIA Linux open GPU kernel module source☆16,815Mar 13, 2026Updated last week
- Heterogeneous GPU Sharing on Kubernetes☆3,110Updated this week
- An open and reliable container runtime☆20,495Mar 13, 2026Updated last week
- ☆338Mar 11, 2026Updated last week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆165,557Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,381Mar 11, 2026Updated last week
- LLM inference in C/C++☆98,098Updated this week
- Go Bindings for the NVIDIA Management Library (NVML)☆426Feb 12, 2026Updated last month
- SGLang is a high-performance serving framework for large language models and multimodal models.☆24,829Updated this week
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆50Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,918Updated this week
- Optimized primitives for collective multi-GPU communication☆4,531Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,371Mar 14, 2026Updated last week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,568Updated this week
- contaiNERD CTL - Docker-compatible CLI for containerd, with support for Compose, Rootless, eStargz, OCIcrypt, IPFS, ...☆9,911Updated this week
- NVIDIA DRA Driver for GPUs☆585Updated this week
- Podman: A tool for managing OCI containers and pods.☆31,063Updated this week
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes☆157Mar 12, 2026Updated last week
- Ongoing research training transformer models at scale☆15,744Updated this week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enter…☆14,745Aug 12, 2024Updated last year
- A collection of useful Go libraries for use with NVIDIA GPU management tools☆50Jan 15, 2026Updated 2 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,869Updated this week
- Fast and memory-efficient exact attention☆22,832Updated this week