ekondis / cl2-reduce-benchLinks
A test case for evaluating the performance of the workgroup reduction operation in OpenCL 2.0
☆10Updated 5 years ago
Alternatives and similar repositories for cl2-reduce-bench
Users that are interested in cl2-reduce-bench are comparing it to the libraries listed below
Sorting:
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Updated 3 years ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆353Updated last week
- A tool which profiles OpenCL devices to find their peak capacities☆481Updated 2 months ago
- BLAS OpenCL implementation.☆16Updated 10 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆255Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆258Updated 3 weeks ago
- Kernel Tuning Toolkit☆67Updated 2 weeks ago
- ☆124Updated 13 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated last year
- The SHOC Benchmark Suite☆260Updated 4 months ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆179Updated 9 years ago
- MIOpenGEMM is now deprecated☆61Updated 2 years ago
- STREAM, for lots of devices written in many programming models☆355Updated 5 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆178Updated this week
- Examples for HIP☆214Updated last year
- Print all known information about all available OpenCL platforms and devices in the system☆371Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆270Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- CUDA accelerated(X) Multi-Precision library☆93Updated 9 years ago
- Python tools for NVIDIA Profiler☆21Updated 8 years ago
- ROCm Device Libraries☆96Updated last year
- ☆43Updated 2 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆448Updated last week
- Flexible GPGPU instrumentation☆89Updated 6 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆390Updated this week
- Tensor Tiling Library☆38Updated 4 months ago
- Efficient CUDA Stream Compaction Library☆35Updated 2 years ago
- ☆29Updated 3 years ago
- ☆34Updated 2 years ago
- ☆61Updated last year