Kernel Tuning Toolkit
☆69Mar 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for KTT
Users that are interested in KTT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- Kernel Tuner☆389Mar 17, 2026Updated last week
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- Implementation of cryptographic primitives in Go☆13Mar 13, 2023Updated 3 years ago
- Work space for golang.org/x/perf version 2☆20Nov 14, 2020Updated 5 years ago
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- ☆14Mar 1, 2025Updated last year
- Slides and exercises for persistent memory programming tutorial☆14Nov 14, 2022Updated 3 years ago
- TLS in Rust (eventually)☆21Mar 12, 2013Updated 13 years ago
- Public proposals, extensions, information and materials from the SYCL working group☆15Jan 26, 2024Updated 2 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Dec 13, 2025Updated 3 months ago
- Julia wrapper of CLBlast, a "tuned OpenCL BLAS library".☆14Aug 23, 2023Updated 2 years ago
- Prototype for a SPIR-V assembler and dissasembler. It provides a composable Java interface for generating SPIR-V code at runtime.☆13Oct 31, 2025Updated 4 months ago
- ☆34Nov 16, 2022Updated 3 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- JOCLBlast - Java bindings for CLBlast☆15Mar 14, 2021Updated 5 years ago
- ☆17Dec 8, 2023Updated 2 years ago
- Predict Performance of GPU Applications using analytical model and Machine Learning☆11Aug 31, 2022Updated 3 years ago
- ☆24Jan 25, 2023Updated 3 years ago
- ☆17Feb 14, 2024Updated 2 years ago
- The Insieme Compiler and Runtime Infrastructure☆35May 23, 2019Updated 6 years ago
- The SHOC Benchmark Suite☆259Oct 6, 2025Updated 5 months ago
- Blue Brain Project nixpkgs configuration - Build a brain with Nix☆20May 10, 2022Updated 3 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆34Mar 19, 2023Updated 3 years ago
- Vulkan compute shader experiment☆11Jan 13, 2021Updated 5 years ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆15Oct 8, 2019Updated 6 years ago
- GPU Static Modeling using PTX and Deep Structured Learning☆18Apr 1, 2020Updated 5 years ago
- Simple anomaly detection for univariate time series data.☆11Jan 8, 2021Updated 5 years ago
- Pressio is latin for compression. Libpressio is a C++ library with C compatible bindings to abstract between different lossless and lossy…☆16Dec 30, 2024Updated last year
- tools to create performance and roofline plots from measured data☆61Jun 10, 2014Updated 11 years ago
- PRI Library☆15Mar 21, 2024Updated 2 years ago
- Tuned OpenCL BLAS☆1,168Feb 1, 2026Updated last month
- Fast and simple constant-time hashing to the BLS12-381 elliptic curve☆44Mar 13, 2020Updated 6 years ago
- `@code_costs`: a variant of `@code_typed` with estimated costs☆13Sep 1, 2020Updated 5 years ago
- Parallel Algorithms for Octree Meshing☆12Dec 31, 2015Updated 10 years ago
- A portable GPU/CPU Path Tracer library powered by SYCL. (OpenCL/CUDA/OpenMP)☆16Feb 19, 2019Updated 7 years ago
- SBLP 2025 MLIR Tutorial☆72Feb 8, 2026Updated last month
- The vOW4SIKE project provides C code that implements the parallel collision search algorithm by van Oorschot and Wiener (vOW). The algori…☆12May 25, 2021Updated 4 years ago
- An extensible framework for program autotuning☆434Jan 29, 2026Updated last month