Tuned OpenCL BLAS
☆1,179Apr 13, 2026Updated 2 months ago
Alternatives and similar repositories for CLBlast
Users that are interested in CLBlast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a software library containing BLAS functions written in OpenCL☆864Aug 2, 2024Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆179Feb 7, 2017Updated 9 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- A synthetic micro-benchmark that measures peak compute, bandwidth, and matrix throughput of GPUs and CPUs☆498Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆721Jul 19, 2025Updated 10 months ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆295Nov 22, 2021Updated 4 years ago
- OpenCL library to train deep convolutional neural networks☆881Jan 5, 2018Updated 8 years ago
- a software library containing Sparse functions written in OpenCL☆176Feb 21, 2020Updated 6 years ago
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆718Jun 8, 2026Updated last week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆364Jun 3, 2026Updated 2 weeks ago
- a software library containing FFT functions written in OpenCL☆648Oct 5, 2022Updated 3 years ago
- pocl - Portable Computing Language☆1,073Jun 12, 2026Updated last week
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,155Jun 11, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Print all known information about all available OpenCL platforms and devices in the system☆379Dec 19, 2025Updated 6 months ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 9 years ago
- An OpenCL device simulator and debugger☆372Mar 24, 2026Updated 2 months ago
- A C++ GPU Computing Library for OpenCL☆1,654Apr 22, 2026Updated last month
- Khronos OpenCL-CLHPP☆420May 29, 2026Updated 2 weeks ago
- BLAS-like Library Instantiation Software Framework☆2,645Nov 11, 2025Updated 7 months ago
- ArrayFire: a general purpose GPU library.☆4,889Mar 7, 2026Updated 3 months ago
- JOCLBlast - Java bindings for CLBlast☆16Mar 14, 2021Updated 5 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,457Jun 12, 2026Updated last week
- an OpenCL based software library containing random number generation functions☆137Nov 19, 2021Updated 4 years ago
- Khronos OpenCL-Headers☆759Jun 4, 2026Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆396Updated this week
- Implementation of OpenCL 3.0 on Vulkan☆441May 6, 2026Updated last month
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆326Aug 11, 2023Updated 2 years ago
- Easy to run kernels using OpenCL☆188Apr 22, 2025Updated last year
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆456Nov 2, 2024Updated last year
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆877Apr 23, 2025Updated last year
- Low-precision matrix multiplication☆1,842Jan 29, 2024Updated 2 years ago
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,882Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆962Updated this week
- The OpenCL ICD Loader project.☆299Updated this week
- Open Source Parallel STL implementation☆531Jan 26, 2024Updated 2 years ago
- OpenCL SDK☆764Jun 3, 2026Updated 2 weeks ago