NVIDIA / nvidia-container-toolkitLinks
Build and run containers leveraging NVIDIA GPUs
☆3,515Updated this week
Alternatives and similar repositories for nvidia-container-toolkit
Users that are interested in nvidia-container-toolkit are comparing it to the libraries listed below
Sorting:
- NVIDIA container runtime library☆996Updated last week
- NVIDIA device plugin for Kubernetes☆3,364Updated last week
- NVIDIA container runtime☆1,122Updated last year
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,232Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,327Updated this week
- An Open Source Machine Learning Framework for Everyone☆1,152Updated last week
- Simple, safe way to store and distribute tensors☆3,380Updated this week
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆5,839Updated this week
- Multi-GPU CUDA stress test☆1,831Updated 11 months ago
- Optimized primitives for collective multi-GPU communication☆3,923Updated 2 weeks ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,930Updated this week
- CUDA Python: Performance meets Productivity☆2,881Updated this week
- AIStore: scalable storage for AI applications☆1,569Updated this week
- Development repository for the Triton language and compiler☆16,484Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,627Updated this week
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm☆9,426Updated last month
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆558Updated 3 months ago
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆7,876Updated 2 months ago
- Ongoing research training transformer models at scale☆13,130Updated this week
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆2,049Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆11,250Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,016Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,450Updated this week
- Docker CLI plugin for extended build capabilities with BuildKit☆4,025Updated last week
- ⚠️DirectML is in maintenance mode ⚠️ DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. Direct…☆2,499Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,413Updated this week
- Transformer related optimization, including BERT, GPT☆6,267Updated last year
- Nvidia GPU exporter for prometheus using nvidia-smi binary☆1,218Updated this week
- This repository contains tutorials and examples for Triton Inference Server☆751Updated last week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆9,614Updated this week