NTNU-HPC-Lab / BAT
A GPU benchmark suite for autotuners
☆17Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for BAT
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated 2 years ago
- A tracing infrastructure for heterogeneous computing applications.☆25Updated last week
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆19Updated 6 months ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆32Updated 3 weeks ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆54Updated last week
- ☆47Updated 5 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆30Updated 3 months ago
- ☆41Updated 4 years ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆29Updated 2 months ago
- ytopt: machine-learning-based search methods for autotuning☆46Updated 3 weeks ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆57Updated 5 months ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆30Updated 3 weeks ago
- Emulating DMA Engines on GPUs for Performance and Portability☆34Updated 9 years ago
- Performance Prediction Toolkit☆51Updated 3 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆21Updated last year
- Loop Kernel Analysis and Performance Modeling Toolkit☆89Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- A unified framework across multiple programming platforms☆33Updated 5 months ago
- Logger for MPI communication☆26Updated last year
- Global Memory and Threading runtime system☆23Updated 6 months ago
- An HPL-AI implementation for Fugaku☆19Updated 3 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- Advanced Profiling and Analytics for AMD Hardware☆137Updated this week
- PIRA - Automatic Instrumentation Refinement☆15Updated 7 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated last month
- Slides and exercises for persistent memory programming tutorial☆11Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated 9 months ago