icl-utk-edu / papi
☆139Updated last week
Alternatives and similar repositories for papi:
Users that are interested in papi are comparing it to the libraries listed below
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆87Updated 11 months ago
- Advanced Profiling and Analytics for AMD Hardware☆143Updated this week
- The University of Bristol HPC Simulation Engine☆96Updated 3 weeks ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆77Updated last month
- Unified Collective Communication Library☆239Updated last week
- A tracing infrastructure for heterogeneous computing applications.☆31Updated this week
- SYCL Benchmark Suite☆64Updated last month
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆64Updated 6 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆132Updated this week
- A light-weight MPI profiler.☆90Updated 8 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆67Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated last week
- ☆15Updated 6 months ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆73Updated 2 weeks ago
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆37Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆64Updated this week
- An easy-to-use automatic performance diagnosis and optimization tool for HPC applications☆34Updated 7 years ago
- ☆43Updated 4 years ago
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆19Updated 5 years ago
- Measure instruction latency and throughput☆23Updated last month
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆49Updated this week
- The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures☆51Updated 2 months ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆140Updated this week
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆132Updated this week
- ☆236Updated last month
- ☆233Updated this week
- Omnitrace: Application Profiling, Tracing, and Analysis☆309Updated 2 weeks ago
- RAJA Performance Suite☆118Updated this week
- TPP experimentation on MLIR for linear algebra☆121Updated 2 weeks ago