NVIDIA / grace-kernelLinks
Upstream Kernel with Grace upstream pending patches for partners. Patches include any bug fixes during Grace production while they await upstreaming.
☆16Updated 2 years ago
Alternatives and similar repositories for grace-kernel
Users that are interested in grace-kernel are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated 2 weeks ago
- Magnum IO community repo☆109Updated 2 months ago
- This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.☆66Updated last week
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆92Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 11 months ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆56Updated last week
- oneAPI Collective Communications Library (oneCCL)☆254Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated last week
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆35Updated 4 months ago
- Simple message passing library☆30Updated 7 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆144Updated this week
- Unified Collective Communication Library☆290Updated last week
- Bandwidth test for ROCm☆75Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆86Updated 2 weeks ago
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆119Updated 6 months ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆95Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated 2 weeks ago
- CUDA GDB☆230Updated last month
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆442Updated 2 weeks ago
- ☆384Updated last year
- NVIDIA GPUDirect Storage Driver☆331Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆130Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated last week
- AMD SMI☆115Updated 2 weeks ago
- GPUDirect Async support for IB Verbs☆135Updated 3 years ago
- oneAPI Level Zero Conformance & Performance test content☆60Updated this week
- AMD’s C++ library for accelerating tensor primitives☆49Updated last week
- super repo for rocm systems projects☆230Updated last week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆65Updated last month