NTNU-HPC-Lab / BATLinks

A GPU benchmark suite for autotuners

☆18

Alternatives and similar repositories for BAT

Users that are interested in BAT are comparing it to the libraries listed below

Sorting:

KernelTuner / kernel_launcher
Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner
☆20Updated last year
LLNL / FPChecker
A dynamic analysis tool to detect floating-point errors in HPC applications.
☆36Updated last week
FZJ-JSC / jubench
JUPITER Benchmark Suite
☆18Updated 11 months ago
LLNL / mpibind
Pragmatic, Productive, and Portable Affinity for HPC
☆41Updated 2 months ago
argonne-lcf / THAPI
A tracing infrastructure for heterogeneous computing applications.
☆33Updated last week
pnnl / COMET
☆40Updated 2 weeks ago
NVIDIA / nvidia-hpcg
NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.
☆57Updated 2 months ago
lanl / PPT
Performance Prediction Toolkit
☆52Updated 7 months ago
hpdps-group / coccl
COCCL: Compression and precision co-aware collective communication library
☆24Updated 4 months ago
ekondis / gpuroofperf-toolkit
A GPU performance prediction toolkit for CUDA programs
☆17Updated 6 years ago
ytopt-team / ytopt
ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization
☆49Updated 3 weeks ago
OpenMP-Validation-and-Verification / OpenMP_VV
OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…
☆59Updated 2 weeks ago
at-aaims / OpenMxP
This is the open source version of HPL-MXP. The code performance has been verified on Frontier
☆17Updated last week
HPCToolkit / hpctoolkit-tutorial-examples
CPU and GPU tutorial examples
☆13Updated 3 months ago
bsc-pm / tampi
The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…
☆24Updated last month
lanl / benchmarks
Benchmarks
☆17Updated 2 months ago
cyanguwa / nersc-roofline
☆45Updated 4 years ago
olcf / olcf-user-docs
Sources for the Oak Ridge Leadership Computing Facility User Documentation
☆66Updated last week
ROCm / roc-stdpar
☆18Updated last year
RIKEN-RCCS / hpl-ai
An HPL-AI implementation for Fugaku
☆21Updated 4 years ago
c3sr / comm_scope
NUMA-aware multi-CPU multi-GPU data transfer benchmarks
☆23Updated last year
ParaStation / psmpi
☆17Updated this week
LLNL / HPAC
☆10Updated 3 months ago
ROCm / rocprofiler-compute
Advanced Profiling and Analytics for AMD Hardware
☆159Updated this week
ORNL / iris
A unified framework across multiple programming platforms
☆41Updated last month
RRZE-HPC / kerncraft
Loop Kernel Analysis and Performance Modeling Toolkit
☆94Updated 3 months ago
mlcommons / hpc
Reference implementations of MLPerf™ HPC training benchmarks
☆48Updated 4 months ago
khaki3 / ptxas-wrapper
A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code
☆15Updated 2 years ago
LLNL / hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
☆34Updated last week
codeplaysoftware / cutlass-sycl
A CUTLASS implementation using SYCL
☆30Updated last week