Tuned OpenCL BLAS
☆1,173Apr 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for CLBlast
Users that are interested in CLBlast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a software library containing BLAS functions written in OpenCL☆864Aug 2, 2024Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆179Feb 7, 2017Updated 9 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- A tool which profiles OpenCL devices to find their peak capacities☆487Apr 11, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆719Jul 19, 2025Updated 9 months ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆294Nov 22, 2021Updated 4 years ago
- OpenCL library to train deep convolutional neural networks☆880Jan 5, 2018Updated 8 years ago
- a software library containing Sparse functions written in OpenCL☆176Feb 21, 2020Updated 6 years ago
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆713Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆358Apr 1, 2026Updated 2 weeks ago
- a software library containing FFT functions written in OpenCL☆650Oct 5, 2022Updated 3 years ago
- pocl - Portable Computing Language☆1,061Apr 9, 2026Updated last week
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,137Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Print all known information about all available OpenCL platforms and devices in the system☆372Dec 19, 2025Updated 4 months ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 8 years ago
- An OpenCL device simulator and debugger☆370Mar 24, 2026Updated 3 weeks ago
- A C++ GPU Computing Library for OpenCL☆1,650Mar 11, 2026Updated last month
- Khronos OpenCL-CLHPP☆416Feb 25, 2026Updated last month
- BLAS-like Library Instantiation Software Framework☆2,624Nov 11, 2025Updated 5 months ago
- ArrayFire: a general purpose GPU library.☆4,881Mar 7, 2026Updated last month
- JOCLBlast - Java bindings for CLBlast☆15Mar 14, 2021Updated 5 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,384Updated this week
- an OpenCL based software library containing random number generation functions☆136Nov 19, 2021Updated 4 years ago
- Khronos OpenCL-Headers☆754Mar 17, 2026Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆391Updated this week
- Implementation of OpenCL 3.0 on Vulkan☆429Updated this week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆325Aug 11, 2023Updated 2 years ago
- Easy to run kernels using OpenCL☆188Apr 22, 2025Updated 11 months ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆456Nov 2, 2024Updated last year
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆876Apr 23, 2025Updated 11 months ago
- Low-precision matrix multiplication☆1,838Jan 29, 2024Updated 2 years ago
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,828Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆949Mar 18, 2026Updated last month
- The OpenCL ICD Loader project.☆296Mar 31, 2026Updated 2 weeks ago
- OpenCL SDK☆755Jan 27, 2026Updated 2 months ago
- Open Source Parallel STL implementation☆530Jan 26, 2024Updated 2 years ago