CLTune: An automatic OpenCL & CUDA kernel tuner
☆185Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for CLTune
Users that are interested in CLTune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tuned OpenCL BLAS☆1,179Apr 13, 2026Updated 2 months ago
- Kernel Tuning Toolkit☆70Updated this week
- Kernel Tuner☆398Jun 9, 2026Updated last week
- a software library containing BLAS functions written in OpenCL☆864Aug 2, 2024Updated last year
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code appendix to an OpenCL matrix-multiplication tutorial☆179Feb 7, 2017Updated 9 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- JOCLBlast - Java bindings for CLBlast☆16Mar 14, 2021Updated 5 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 9 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆364Jun 3, 2026Updated 2 weeks ago
- An OpenCL device simulator and debugger☆372Mar 24, 2026Updated 2 months ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆326Aug 11, 2023Updated 2 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆295Nov 22, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆87Aug 18, 2018Updated 7 years ago
- Print all known information about all available OpenCL platforms and devices in the system☆379Dec 19, 2025Updated 5 months ago
- OpenCL for Visual Studio Code☆39Updated this week
- SkelCL is a library providing high-level abstractions for alleviated programming of modern parallel heterogeneous systems. SkelCL is a re…☆30Sep 15, 2016Updated 9 years ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Nov 27, 2016Updated 9 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Dec 9, 2019Updated 6 years ago
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- OpenCL specific C++ libraries implemented in C++ for OpenCL kernel language published in releases of OpenCL-Docs☆125Mar 6, 2023Updated 3 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆461May 31, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Aug 15, 2019Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Apr 20, 2017Updated 9 years ago
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆21Apr 25, 2024Updated 2 years ago
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆718Jun 8, 2026Updated last week
- Automatically exported from code.google.com/p/freeocl☆30Jan 6, 2018Updated 8 years ago
- Khronos OpenCL-CLHPP☆420May 29, 2026Updated 2 weeks ago
- AES-based random number generator in C☆11Apr 27, 2015Updated 11 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- C, C++ and Python Code for Exercises and Solutions☆536Dec 17, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆49Feb 10, 2015Updated 11 years ago
- an OpenCL based software library containing random number generation functions☆137Nov 19, 2021Updated 4 years ago
- Easy to run kernels using OpenCL☆188Apr 22, 2025Updated last year
- portDNN is a library implementing neural network algorithms written using SYCL☆114May 21, 2024Updated 2 years ago
- pocl - Portable Computing Language☆1,073Jun 12, 2026Updated last week
- GPUVerify: a Verifier for GPU Kernels☆82Jul 28, 2022Updated 3 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Oct 12, 2019Updated 6 years ago