icl-utk-edu / papi
☆144Updated 2 weeks ago
Alternatives and similar repositories for papi:
Users that are interested in papi are comparing it to the libraries listed below
- Advanced Profiling and Analytics for AMD Hardware☆147Updated this week
- ☆241Updated 2 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated this week
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated last week
- SYCL Benchmark Suite☆64Updated last month
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆134Updated 3 weeks ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆52Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆107Updated this week
- ☆237Updated this week
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆59Updated 5 months ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆89Updated last year
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆76Updated this week
- Unified Collective Communication Library☆246Updated this week
- ☆46Updated this week
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆68Updated this week
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆151Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆134Updated this week
- The University of Bristol HPC Simulation Engine☆96Updated last month
- SYCL Open Source Specification☆134Updated last week
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆142Updated this week
- GPUDirect Async support for IB Verbs☆110Updated 2 years ago
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆21Updated 3 years ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆81Updated last week
- Omnitrace: Application Profiling, Tracing, and Analysis☆311Updated 2 weeks ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆42Updated last year
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated last month
- The Splash-3 benchmark suite☆44Updated last year
- The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures☆51Updated last week