GeorgOfenbeck / perfplot
tools to create performance and roofline plots from measured data
☆58Updated 10 years ago
Alternatives and similar repositories for perfplot:
Users that are interested in perfplot are comparing it to the libraries listed below
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆92Updated this week
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 3 years ago
- ☆43Updated 4 years ago
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆139Updated this week
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Updated 2 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆64Updated 6 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 5 months ago
- Chai☆43Updated last year
- ☆59Updated 5 months ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆65Updated last week
- The ultimate memory bandwidth benchmark☆47Updated last month
- ☆52Updated 5 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆77Updated last month
- ☆34Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computation☆53Updated 3 weeks ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 3 months ago
- The Splash-3 benchmark suite☆43Updated last year
- TLB Benchmarks☆33Updated 7 years ago
- A tracing infrastructure for heterogeneous computing applications.☆29Updated this week
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆22Updated 6 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆61Updated this week
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆44Updated 5 years ago
- Automatically exported from code.google.com/p/patus☆15Updated 9 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 7 years ago
- Measure instruction latency and throughput☆23Updated 2 weeks ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆24Updated 5 years ago
- RAJA Performance Suite☆119Updated this week