ekondis / mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
☆363Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for mixbench
- Stretching GPU performance for GEMMs and tensor contractions.☆220Updated this week
- ROCm Communication Collectives Library (RCCL)☆267Updated this week
- Examples for HIP☆201Updated this week
- Next generation BLAS implementation for ROCm platform☆346Updated this week
- An implementation of BLAS using the SYCL open standard.☆259Updated last week
- A tool which profiles OpenCL devices to find their peak capacities☆409Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆201Updated last week
- CUDA Kernel Benchmarking Library☆513Updated 2 weeks ago
- HIPIFY: Convert CUDA to Portable C++ Code☆523Updated this week
- oneAPI Collective Communications Library (oneCCL)☆201Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆313Updated last week
- STREAM, for lots of devices written in many programming models☆325Updated 2 months ago
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆223Updated this week
- ROCm Parallel Primitives☆161Updated this week
- The SHOC Benchmark Suite☆247Updated 2 years ago
- ☆215Updated this week
- oneAPI Level Zero Specification Headers and Loader☆218Updated last week
- AMD's graph optimization engine.☆185Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆309Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆200Updated 2 years ago
- ☆228Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆170Updated last year
- SYCL Open Source Specification☆114Updated this week
- rocWMMA☆91Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆127Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆517Updated 5 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆207Updated this week
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆107Updated last year