GeorgOfenbeck / perfplot
tools to create performance and roofline plots from measured data
☆58Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for perfplot
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 5 years ago
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- Loop Kernel Analysis and Performance Modeling Toolkit☆89Updated 2 months ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆72Updated 8 months ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- ☆41Updated 4 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 2 months ago
- GPUDirect Async support for IB Verbs☆90Updated 2 years ago
- Chai☆42Updated 11 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated this week
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆132Updated this week
- The ultimate memory bandwidth benchmark☆46Updated last year
- RAJA Performance Suite☆110Updated last week
- A Benchmark Suite for Heterogeneous System Computation☆52Updated 3 weeks ago
- ☆58Updated last month
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆24Updated 4 years ago
- Logger for MPI communication☆26Updated last year
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- ☆47Updated 5 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆20Updated 6 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated last month
- Stencil Probe - a stencil microbenchmark☆29Updated 11 years ago
- ☆17Updated 2 years ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆44Updated 5 years ago
- Measure instruction latency and throughput☆22Updated 2 years ago
- ☆34Updated 2 years ago