NVIDIA / grace-kernel
Upstream Kernel with Grace upstream pending patches for partners. Patches include any bug fixes during Grace production while they await upstreaming.
☆15Updated last year
Alternatives and similar repositories for grace-kernel:
Users that are interested in grace-kernel are comparing it to the libraries listed below
- CUDA GDB☆199Updated last month
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated last week
- Bandwidth test for ROCm☆54Updated 2 weeks ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated last week
- Magnum IO community repo☆86Updated 2 months ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆67Updated this week
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆59Updated this week
- The compiler support repository provides various Lightning Compiler related services.☆45Updated 10 months ago
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆86Updated 5 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated last week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆40Updated last week
- ☆34Updated last week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- HPCG benchmark based on ROCm platform☆37Updated 2 weeks ago
- This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.☆54Updated this week
- RCCL Performance Benchmark Tests☆60Updated 2 weeks ago
- oneAPI Collective Communications Library (oneCCL)☆227Updated last week
- CMake modules used within the ROCm libraries☆65Updated this week
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- Unified Collective Communication Library☆239Updated last week
- Linux Cross-Memory Attach☆90Updated 6 months ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆87Updated 11 months ago
- oneCCL Bindings for Pytorch*☆91Updated 2 weeks ago
- ROCm SPARSE marshalling library☆67Updated this week
- ☆106Updated 3 weeks ago
- ROCm Device Libraries☆97Updated 10 months ago
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆37Updated last week
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- MPI accelerator-integrated communication extensions☆32Updated last year
- ROCm BLAS marshalling library☆134Updated this week