CNugteren / CLBlast
Tuned OpenCL BLAS
☆1,090Updated 4 months ago
Alternatives and similar repositories for CLBlast:
Users that are interested in CLBlast are comparing it to the libraries listed below
- a software library containing BLAS functions written in OpenCL☆852Updated 7 months ago
- pocl - Portable Computing Language☆970Updated this week
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- A tool which profiles OpenCL devices to find their peak capacities☆435Updated 3 months ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,735Updated last year
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆706Updated 5 months ago
- a software library containing FFT functions written in OpenCL☆629Updated 2 years ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆867Updated this week
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆286Updated 3 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆531Updated last week
- HIPIFY: Convert CUDA to Portable C++ Code☆564Updated this week
- An OpenCL device simulator and debugger☆353Updated 6 months ago
- CUDA Kernel Benchmarking Library☆593Updated last week
- Khronos OpenCL-CLHPP☆392Updated 2 months ago
- oneAPI Math Library (oneMath)☆652Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆437Updated 4 years ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆326Updated 2 weeks ago
- The OpenCL ICD Loader project.☆257Updated 2 weeks ago
- AMD's Machine Intelligence Library☆1,130Updated this week
- Assembler for NVIDIA Maxwell architecture☆981Updated 2 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,560Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆389Updated 2 months ago
- A C++ GPU Computing Library for OpenCL☆1,589Updated last week
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆825Updated this week
- Patterns and behaviors for GPU computing☆1,707Updated 2 years ago
- a software library containing Sparse functions written in OpenCL☆173Updated 5 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆663Updated last month
- CUDA Data Parallel Primitives Library☆428Updated 6 years ago