ROCm / rocprofiler-computeLinks
Advanced Profiling and Analytics for AMD Hardware
☆159Updated this week
Alternatives and similar repositories for rocprofiler-compute
Users that are interested in rocprofiler-compute are comparing it to the libraries listed below
Sorting:
- ROC profiler library. Profiling with perf-counters and derived metrics.☆149Updated 3 weeks ago
- Next generation SPARSE implementation for ROCm platform☆129Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆172Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- ☆28Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆91Updated last week
- Next generation LAPACK implementation for ROCm platform☆105Updated this week
- Bandwidth test for ROCm☆59Updated 2 weeks ago
- SYCL Open Source Specification☆136Updated last week
- STREAM, for lots of devices written in many programming models☆344Updated 10 months ago
- SYCL Benchmark Suite☆65Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆84Updated 2 weeks ago
- rocWMMA☆118Updated this week
- ROCm BLAS marshalling library☆144Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆106Updated last month
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated 2 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆58Updated 3 weeks ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆84Updated 3 weeks ago
- ☆247Updated last month
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆229Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆121Updated this week
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆97Updated last year
- ROCm SPARSE marshalling library☆67Updated last week
- HPCG benchmark based on ROCm platform☆37Updated last week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆225Updated this week
- AMD’s C++ library for accelerating tensor primitives☆43Updated last week
- ☆34Updated last year
- ☆148Updated last week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆42Updated this week
- ☆37Updated last week