NVIDIA / nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
☆2,687Updated this week
Alternatives and similar repositories for nvidia-container-toolkit:
Users that are interested in nvidia-container-toolkit are comparing it to the libraries listed below
- NVIDIA container runtime library☆881Updated last month
- NVIDIA container runtime☆1,112Updated last year
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆9,147Updated this week
- Multi-GPU CUDA stress test☆1,514Updated 4 months ago
- NVIDIA device plugin for Kubernetes☆2,961Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,667Updated this week
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆5,017Updated this week
- Build and run Docker containers leveraging NVIDIA GPUs☆17,300Updated last year
- Simple, safe way to store and distribute tensors☆3,010Updated last week
- AIStore: scalable storage for AI applications☆1,355Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,694Updated this week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆6,756Updated 5 months ago
- A blazing fast inference solution for text embeddings models☆3,043Updated last week
- DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for comm…☆2,313Updated last month
- Accessible large language models via k-bit quantization for PyTorch.☆6,522Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs…☆2,086Updated this week
- Tools for building GPU clusters☆1,279Updated 10 months ago
- CUDA Python: Performance meets Productivity☆1,045Updated this week
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆1,707Updated this week
- Manage GPU clusters for running AI models☆1,048Updated this week
- Transformer related optimization, including BERT, GPT☆5,981Updated 9 months ago
- An Open Source Machine Learning Framework for Everyone☆1,050Updated 3 months ago
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆440Updated last week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆763Updated last month
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,597Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆8,910Updated this week
- Optimized primitives for collective multi-GPU communication☆3,375Updated last week
- Intel® NPU Acceleration Library☆578Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,647Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,056Updated last month