CNugteren / CLBlastLinks
Tuned OpenCL BLAS
☆1,114Updated last month
Alternatives and similar repositories for CLBlast
Users that are interested in CLBlast are comparing it to the libraries listed below
Sorting:
- a software library containing BLAS functions written in OpenCL☆856Updated 10 months ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆879Updated 2 weeks ago
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆712Updated 2 months ago
- A tool which profiles OpenCL devices to find their peak capacities☆452Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 5 months ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆288Updated 3 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆179Updated 2 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆861Updated last month
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆542Updated 3 weeks ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆324Updated last year
- oneAPI Math Library (oneMath)☆687Updated last week
- A C++ GPU Computing Library for OpenCL☆1,615Updated last month
- Next generation BLAS implementation for ROCm platform☆381Updated this week
- Open Source Parallel STL implementation☆528Updated last year
- An OpenCL device simulator and debugger☆355Updated 9 months ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,752Updated last year
- Assembler for NVIDIA Maxwell architecture☆1,005Updated 2 years ago
- a software library containing Sparse functions written in OpenCL☆175Updated 5 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆445Updated 7 months ago
- Low-precision matrix multiplication☆1,805Updated last year
- Khronos OpenCL-CLHPP☆398Updated last week
- pocl - Portable Computing Language☆1,004Updated this week
- a software library containing FFT functions written in OpenCL☆636Updated 2 years ago
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,653Updated last week
- Code appendix to an OpenCL matrix-multiplication tutorial☆171Updated 8 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆404Updated 5 months ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆437Updated 5 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆413Updated last week
- Source code examples from the Parallel Forall Blog☆1,290Updated 10 months ago
- Patterns and behaviors for GPU computing☆1,724Updated 2 years ago