CLTune: An automatic OpenCL & CUDA kernel tuner
☆185Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for CLTune
Users that are interested in CLTune are comparing it to the libraries listed below
Sorting:
- Kernel Tuning Toolkit☆68Updated this week
- Tuned OpenCL BLAS☆1,168Feb 1, 2026Updated last month
- Kernel Tuner☆388Updated this week
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- a software library containing BLAS functions written in OpenCL☆865Aug 2, 2024Updated last year
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 3 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Jan 13, 2025Updated last year
- Code appendix to an OpenCL matrix-multiplication tutorial☆179Feb 7, 2017Updated 9 years ago
- An OpenCL device simulator and debugger☆369Updated this week
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 5 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆354Mar 2, 2026Updated last week
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆294Nov 22, 2021Updated 4 years ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Nov 27, 2016Updated 9 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆33Jan 5, 2022Updated 4 years ago
- Print all known information about all available OpenCL platforms and devices in the system☆372Dec 19, 2025Updated 2 months ago
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 7 years ago
- The SHOC Benchmark Suite☆260Oct 6, 2025Updated 5 months ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆325Aug 11, 2023Updated 2 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆450Feb 7, 2026Updated last month
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- Modular Expression Language for Ordinary Differential Equation Editing☆12Nov 10, 2021Updated 4 years ago
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆22Apr 25, 2024Updated last year
- JOCLBlast - Java bindings for CLBlast☆15Mar 14, 2021Updated 4 years ago
- Caffe: a fast open framework for deep learning. With OpenCL and CUDA support.☆87Aug 18, 2018Updated 7 years ago
- CLAW Compiler for Performance Portability☆42Dec 15, 2022Updated 3 years ago
- tutorial to optimize GEMM performance on android☆51Feb 17, 2016Updated 10 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Dec 9, 2019Updated 6 years ago
- Implementation of the SYCL specification.☆66Jun 19, 2024Updated last year
- ☆11Aug 8, 2021Updated 4 years ago
- ☆14Aug 4, 2022Updated 3 years ago
- cinema toolkit for large data analysis and visualization☆13Sep 14, 2022Updated 3 years ago
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆14Sep 26, 2023Updated 2 years ago
- Yaksa: High-performance Noncontiguous Data Management☆15Oct 1, 2025Updated 5 months ago
- OpenCL for Visual Studio Code☆39Feb 7, 2026Updated last month
- VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP☆718Jul 19, 2025Updated 7 months ago
- List all available information about all SYCL devices and platforms☆15Sep 14, 2020Updated 5 years ago
- Distributed Interactive Visualization and Exploration of large datasets☆15May 11, 2016Updated 9 years ago
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆19Mar 1, 2025Updated last year