CLTune: An automatic OpenCL & CUDA kernel tuner
☆185Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for CLTune
Users that are interested in CLTune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tuned OpenCL BLAS☆1,173Apr 3, 2026Updated 2 weeks ago
- Kernel Tuning Toolkit☆69Updated this week
- Kernel Tuner☆389Updated this week
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- a software library containing BLAS functions written in OpenCL☆864Aug 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 5 months ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆179Feb 7, 2017Updated 9 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- JOCLBlast - Java bindings for CLBlast☆15Mar 14, 2021Updated 5 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 7 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆358Apr 1, 2026Updated 2 weeks ago
- An OpenCL device simulator and debugger☆370Mar 24, 2026Updated 3 weeks ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆325Aug 11, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The SHOC Benchmark Suite☆259Oct 6, 2025Updated 6 months ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆294Nov 22, 2021Updated 4 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆87Aug 18, 2018Updated 7 years ago
- Print all known information about all available OpenCL platforms and devices in the system☆372Dec 19, 2025Updated 4 months ago
- OpenCL for Visual Studio Code☆39Feb 7, 2026Updated 2 months ago
- SkelCL is a library providing high-level abstractions for alleviated programming of modern parallel heterogeneous systems. SkelCL is a re…☆30Sep 15, 2016Updated 9 years ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Nov 27, 2016Updated 9 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Dec 9, 2019Updated 6 years ago
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OpenCL specific C++ libraries implemented in C++ for OpenCL kernel language published in releases of OpenCL-Docs☆125Mar 6, 2023Updated 3 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆452Feb 7, 2026Updated 2 months ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Aug 15, 2019Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 8 years ago
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆22Apr 25, 2024Updated last year
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆713Updated this week
- Automatically exported from code.google.com/p/freeocl☆30Jan 6, 2018Updated 8 years ago
- Khronos OpenCL-CLHPP☆416Feb 25, 2026Updated last month
- AES-based random number generator in C☆11Apr 27, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- C, C++ and Python Code for Exercises and Solutions☆536Dec 17, 2019Updated 6 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆48Feb 10, 2015Updated 11 years ago
- an OpenCL based software library containing random number generation functions☆136Nov 19, 2021Updated 4 years ago
- A simple utility to create user-specified git commit hashes☆15Nov 24, 2025Updated 4 months ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113May 21, 2024Updated last year
- pocl - Portable Computing Language☆1,061Apr 9, 2026Updated last week