GMAP / NPB-CPP
The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures
☆51Updated last week
Alternatives and similar repositories for NPB-CPP:
Users that are interested in NPB-CPP are comparing it to the libraries listed below
- NAS Parallel Benchmarks for evaluating GPU and APIs☆24Updated 2 months ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆21Updated 3 years ago
- ☆16Updated 7 months ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆31Updated last year
- ☆34Updated 3 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆42Updated last year
- Logger for MPI communication☆26Updated last year
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Chai☆43Updated last year
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆76Updated this week
- A Micro-benchmarking Tool for HPC Networks☆27Updated 3 months ago
- Light-weight Performance Variance Detection for Production-run Parallel Applications☆13Updated last year
- ☆25Updated 4 years ago
- The Splash-3 benchmark suite☆44Updated last year
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆33Updated 2 years ago
- A tracing infrastructure for heterogeneous computing applications.☆31Updated this week
- Measure instruction latency and throughput☆24Updated last month
- An unofficial mirror of the core PARSEC 3.0 benchmark suite with patches to run on x86_64 Arch Linux and generalize builds.☆105Updated 2 years ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆25Updated last year
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆68Updated this week
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 7 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆52Updated last week
- LonestarGPU: Irregular algorithms parallelized for GPUs☆34Updated 5 years ago
- Performance Prediction Toolkit☆51Updated 4 months ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆64Updated 6 years ago
- ☆59Updated 6 months ago
- ☆21Updated 2 years ago