FZJ-JSC / jubench
JUPITER Benchmark Suite
☆13Updated 7 months ago
Alternatives and similar repositories for jubench:
Users that are interested in jubench are comparing it to the libraries listed below
- ☆17Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- A tracing infrastructure for heterogeneous computing applications.☆29Updated last week
- ☆10Updated this week
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆22Updated 6 years ago
- ☆42Updated 4 years ago
- ☆14Updated 4 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- ☆11Updated 2 weeks ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆76Updated 11 months ago
- Training examples for SYCL☆39Updated last month
- A Micro-benchmarking Tool for HPC Networks☆25Updated last month
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆46Updated 3 weeks ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 3 years ago
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 3 years ago
- Advanced Profiling and Analytics for AMD Hardware☆140Updated this week
- A task benchmark☆41Updated 6 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆56Updated last week
- SST Macro Element Library☆36Updated this week
- ☆17Updated 5 years ago
- Chai☆42Updated last year
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated last year
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 2 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆46Updated this week
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆20Updated 10 months ago
- MPI accelerator-integrated communication extensions☆32Updated last year
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆22Updated last year
- NAS Parallel Benchmarks for evaluating GPU and APIs☆23Updated last week
- CPU and GPU tutorial examples☆13Updated this week
- Logger for MPI communication☆26Updated last year