ekondis / mixbenchLinks
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
☆404Updated 5 months ago
Alternatives and similar repositories for mixbench
Users that are interested in mixbench are comparing it to the libraries listed below
Sorting:
- Examples for HIP☆208Updated 6 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆245Updated this week
- CUDA Kernel Benchmarking Library☆666Updated last week
- Next generation BLAS implementation for ROCm platform☆381Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆258Updated this week
- STREAM, for lots of devices written in many programming models☆343Updated 9 months ago
- A tool which profiles OpenCL devices to find their peak capacities☆454Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 5 months ago
- ☆247Updated last week
- ROCm Communication Collectives Library (RCCL)☆342Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆587Updated this week
- ROCm BLAS marshalling library☆144Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆148Updated this week
- The SHOC Benchmark Suite☆255Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- ☆140Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆179Updated 2 years ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆330Updated 2 weeks ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆156Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆227Updated 3 weeks ago
- collection of benchmarks to measure basic GPU capabilities☆384Updated 4 months ago
- oneAPI Collective Communications Library (oneCCL)☆237Updated last week
- ☆261Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆173Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆221Updated 3 years ago
- rocWMMA☆115Updated this week
- ☆255Updated 2 weeks ago
- Advanced Profiling and Analytics for AMD Hardware☆156Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆423Updated this week
- Next generation FFT implementation for ROCm☆195Updated this week