ekondis / mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
☆397Updated 3 months ago
Alternatives and similar repositories for mixbench:
Users that are interested in mixbench are comparing it to the libraries listed below
- CUDA Kernel Benchmarking Library☆629Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆237Updated this week
- A tool which profiles OpenCL devices to find their peak capacities☆441Updated 4 months ago
- Next generation BLAS implementation for ROCm platform☆367Updated this week
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last month
- HIPIFY: Convert CUDA to Portable C++ Code☆574Updated this week
- Examples for HIP☆205Updated 5 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- ROCm Communication Collectives Library (RCCL)☆330Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆393Updated this week
- collection of benchmarks to measure basic GPU capabilities☆369Updated 2 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- ROCm Parallel Primitives☆171Updated this week
- rocWMMA☆110Updated this week
- STREAM, for lots of devices written in many programming models☆334Updated 8 months ago
- ☆251Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆218Updated 3 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆697Updated 2 months ago
- AMD's graph optimization engine.☆216Updated this week
- ☆244Updated 2 months ago
- Advanced Profiling and Analytics for AMD Hardware☆152Updated this week
- ☆239Updated this week
- ☆131Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆144Updated last week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆269Updated this week
- ☆60Updated 4 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆328Updated this week
- A tool for bandwidth measurements on NVIDIA GPUs.☆413Updated 3 weeks ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago