NTNU-HPC-Lab / BATLinks
A GPU benchmark suite for autotuners
☆18Updated last year
Alternatives and similar repositories for BAT
Users that are interested in BAT are comparing it to the libraries listed below
Sorting:
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆20Updated last year
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆36Updated last week
- JUPITER Benchmark Suite☆18Updated 11 months ago
- Pragmatic, Productive, and Portable Affinity for HPC☆41Updated 2 months ago
- A tracing infrastructure for heterogeneous computing applications.☆33Updated last week
- ☆40Updated 2 weeks ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆57Updated 2 months ago
- Performance Prediction Toolkit☆52Updated 7 months ago
- COCCL: Compression and precision co-aware collective communication library☆24Updated 4 months ago
- A GPU performance prediction toolkit for CUDA programs☆17Updated 6 years ago
- ytopt: machine-learning-based autotuning and hyperparameter optimization framework using Bayesian Optimization☆49Updated 3 weeks ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆59Updated 2 weeks ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆17Updated last week
- CPU and GPU tutorial examples☆13Updated 3 months ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆24Updated last month
- Benchmarks☆17Updated 2 months ago
- ☆45Updated 4 years ago
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆66Updated last week
- ☆18Updated last year
- An HPL-AI implementation for Fugaku☆21Updated 4 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- ☆17Updated this week
- ☆10Updated 3 months ago
- Advanced Profiling and Analytics for AMD Hardware☆159Updated this week
- A unified framework across multiple programming platforms☆41Updated last month
- Loop Kernel Analysis and Performance Modeling Toolkit☆94Updated 3 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 4 months ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Updated 2 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Updated last week
- A CUTLASS implementation using SYCL☆30Updated last week