parallelcodefoundry / ParEvalLinks
A Parallel Code Evaluation Benchmark
☆36Updated 3 months ago
Alternatives and similar repositories for ParEval
Users that are interested in ParEval are comparing it to the libraries listed below
Sorting:
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆117Updated this week
- ☆13Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆162Updated this week
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 7 months ago
- Multi-GPU communication profiler and visualizer☆34Updated last year
- A hierarchical collective communications library with portable optimizations☆36Updated 9 months ago
- RCCL Performance Benchmark Tests☆76Updated last week
- GPU Performance Advisor☆66Updated 3 years ago
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆49Updated 3 weeks ago
- ☆16Updated 5 months ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆301Updated 3 weeks ago
- ☆18Updated 5 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Updated last month
- JUPITER Benchmark Suite☆20Updated 2 months ago
- This repo contains the dataset for paper: Creating a Dataset Supporting Translation Between OpenMP Fortran and C++ Code☆15Updated last year
- Benchmark for measuring the performance of sparse and irregular memory access.☆79Updated last month
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆110Updated 2 years ago
- A light-weight MPI profiler.☆97Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆152Updated this week
- The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures☆59Updated 3 weeks ago
- ☆10Updated 6 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆31Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆62Updated last month
- ☆33Updated last year
- ☆263Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last week
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 3 years ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆71Updated last month