ROCm / rocprofiler-compute
Advanced Profiling and Analytics for AMD Hardware
☆135Updated this week
Related projects ⓘ
Alternatives and complementary repositories for rocprofiler-compute
- ROCm Parallel Primitives☆161Updated this week
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆128Updated this week
- Next generation SPARSE implementation for ROCm platform☆116Updated this week
- ☆14Updated this week
- Reusable software components for ROCm developers☆78Updated this week
- Next generation LAPACK implementation for ROCm platform☆93Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆65Updated 10 months ago
- SYCL Benchmark Suite☆56Updated 2 months ago
- ☆42Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated 3 weeks ago
- ROCm SPARSE marshalling library☆69Updated this week
- SYCL Open Source Specification☆114Updated this week
- RAJA Performance Suite☆110Updated last week
- oneAPI Level Zero Conformance & Performance test content☆46Updated this week
- ☆17Updated 9 months ago
- MPI accelerator-integrated communication extensions☆32Updated last year
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆23Updated this week
- ☆128Updated this week
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆91Updated 4 months ago
- Bandwidth test for ROCm☆47Updated this week
- rocWMMA☆91Updated this week
- RAND library for HIP programming language☆110Updated this week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆122Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated 3 weeks ago
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆206Updated this week
- RCCL Performance Benchmark Tests☆49Updated 2 weeks ago
- ☆58Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆201Updated 2 weeks ago