GMAP / NPB-GPU
NAS Parallel Benchmarks for evaluating GPU and APIs
☆21Updated 5 months ago
Alternatives and similar repositories for NPB-GPU:
Users that are interested in NPB-GPU are comparing it to the libraries listed below
- The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures☆50Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆48Updated this week
- ☆17Updated last year
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆45Updated last week
- Logger for MPI communication☆26Updated last year
- JUPITER Benchmark Suite☆12Updated 5 months ago
- OpenMP vs Offload☆21Updated last year
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆21Updated 3 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 5 years ago
- ☆14Updated 4 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆75Updated 10 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆45Updated 8 months ago
- ☆10Updated 6 months ago
- A Micro-benchmarking Tool for HPC Networks☆24Updated 2 weeks ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated last week
- A tracing infrastructure for heterogeneous computing applications.☆28Updated 2 weeks ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆30Updated 3 months ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- ☆42Updated 4 years ago
- Comb is a communication performance benchmarking tool.☆24Updated last year
- HPCG benchmark based on ROCm platform☆35Updated 2 weeks ago
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 3 years ago
- ☆14Updated 4 months ago
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆39Updated this week
- ☆34Updated 2 years ago
- MPI accelerator-integrated communication extensions☆32Updated last year
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated 11 months ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆44Updated 5 years ago
- Official BOLT Repository☆28Updated 5 months ago
- Slides and exercises for persistent memory programming tutorial☆12Updated 2 years ago