NVIDIA / nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
☆2,472Updated this week
Related projects ⓘ
Alternatives and complementary repositories for nvidia-container-toolkit
- NVIDIA container runtime library☆846Updated this week
- NVIDIA container runtime☆1,108Updated last year
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆924Updated this week
- NVIDIA device plugin for Kubernetes☆2,835Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆8,681Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆1,854Updated this week
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆4,828Updated 3 weeks ago
- AIStore: scalable storage for AI applications☆1,290Updated this week
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm☆8,235Updated 2 weeks ago
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆2,341Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆10,820Updated 2 weeks ago
- Multi-GPU CUDA stress test☆1,435Updated 3 months ago
- Build and run Docker containers leveraging NVIDIA GPUs☆17,259Updated 11 months ago
- Nvidia GPU exporter for prometheus using nvidia-smi binary☆898Updated this week
- An Open Source Machine Learning Framework for Everyone☆997Updated last month
- Fast and memory-efficient exact attention☆14,279Updated this week
- Tools for monitoring NVIDIA GPUs on Linux☆1,018Updated 3 years ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,624Updated this week
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆415Updated this week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆742Updated this week
- Tools for building GPU clusters☆1,265Updated 8 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,680Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆8,660Updated last week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,415Updated 10 months ago
- OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference☆7,304Updated this week
- Apptainer: Application containers for Linux☆1,134Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,597Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆570Updated this week
- Large Language Model Text Generation Inference☆9,122Updated this week