CNugteren / CLBlast
Tuned OpenCL BLAS
☆1,071Updated 2 months ago
Alternatives and similar repositories for CLBlast:
Users that are interested in CLBlast are comparing it to the libraries listed below
- a software library containing BLAS functions written in OpenCL☆847Updated 5 months ago
- A tool which profiles OpenCL devices to find their peak capacities☆424Updated 3 weeks ago
- pocl - Portable Computing Language☆947Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆856Updated this week
- a software library containing FFT functions written in OpenCL☆628Updated 2 years ago
- Next generation BLAS implementation for ROCm platform☆355Updated this week
- Khronos OpenCL-Headers☆687Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆172Updated 2 years ago
- An OpenCL device simulator and debugger☆350Updated 4 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆262Updated this week
- Khronos OpenCL-CLHPP☆383Updated this week
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆705Updated 3 months ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆845Updated 6 months ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆283Updated 3 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,690Updated last year
- Patterns and behaviors for GPU computing☆1,690Updated 2 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆440Updated 2 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆537Updated this week
- A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework☆192Updated 4 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆372Updated this week
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆322Updated last year
- A C++ GPU Computing Library for OpenCL☆1,570Updated last month
- Code appendix to an OpenCL matrix-multiplication tutorial☆164Updated 7 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆519Updated 7 months ago
- oneAPI Math Library (oneMath)☆635Updated last week
- Assembler for NVIDIA Maxwell architecture☆963Updated 2 years ago
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,477Updated this week
- Open Source Parallel STL implementation☆519Updated 11 months ago
- AMD's Machine Intelligence Library☆1,095Updated this week