NVIDIA / nvidia-container-toolkitLinks
Build and run containers leveraging NVIDIA GPUs
☆3,282Updated this week
Alternatives and similar repositories for nvidia-container-toolkit
Users that are interested in nvidia-container-toolkit are comparing it to the libraries listed below
Sorting:
- NVIDIA container runtime library☆961Updated last week
- NVIDIA container runtime☆1,116Updated last year
- An Open Source Machine Learning Framework for Everyone☆1,144Updated 8 months ago
- NVIDIA device plugin for Kubernetes☆3,238Updated this week
- Simple, safe way to store and distribute tensors☆3,282Updated this week
- AIStore: scalable storage for AI applications☆1,519Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,145Updated this week
- Build and run Docker containers leveraging NVIDIA GPUs☆17,382Updated last year
- Multi-GPU CUDA stress test☆1,714Updated 9 months ago
- Docker CLI plugin for extended build capabilities with BuildKit☆3,937Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,548Updated last week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,209Updated last week
- Development repository for the Triton language and compiler☆15,735Updated this week
- Tools for building GPU clusters☆1,360Updated last month
- CUDA Python: Performance meets Productivity☆2,719Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,104Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆10,629Updated this week
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆520Updated last month
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆9,298Updated this week
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆5,596Updated last week
- Fast and memory-efficient exact attention☆17,664Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,929Updated last week
- Ongoing research training transformer models at scale☆12,468Updated last week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,450Updated last week
- Large Language Model Text Generation Inference☆10,172Updated last week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,675Updated 2 weeks ago
- Optimized primitives for collective multi-GPU communication☆3,761Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆14,814Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,866Updated this week
- Official inference library for Mistral models☆10,275Updated 2 months ago