NVIDIA / apt-packaging-cuda-keyring
CUDA keyring packaging for Debian
☆13Updated 2 years ago
Alternatives and similar repositories for apt-packaging-cuda-keyring
Users that are interested in apt-packaging-cuda-keyring are comparing it to the libraries listed below
Sorting:
- AMD SMI☆65Updated this week
- ☆58Updated 10 months ago
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated last week
- AMD related optimizations for transformer models☆75Updated 6 months ago
- Bandwidth test for ROCm☆55Updated last week
- python package of rocm-smi-lib☆20Updated 7 months ago
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆95Updated this week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆17Updated last week
- ☆18Updated last week
- CMake modules used within the ROCm libraries☆66Updated last week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated 2 weeks ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆46Updated 2 months ago
- ☆19Updated this week
- AMD’s C++ library for accelerating tensor primitives☆40Updated last week
- Inference server benchmarking tool☆59Updated 2 weeks ago
- ROCm BLAS marshalling library☆141Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆27Updated this week
- ☆69Updated last month
- Port of Facebook's LLaMA model in C/C++☆22Updated 8 months ago
- Gpu benchmark☆61Updated 3 months ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆68Updated this week
- Distributed preprocessing and data loading for language datasets☆39Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆138Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- Benchmarks to capture important workloads.☆31Updated 3 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 4 months ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆115Updated 5 months ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated this week
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 2 months ago