Kernel Tuning Toolkit
☆70May 14, 2026Updated last week
Alternatives and similar repositories for KTT
Users that are interested in KTT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- Kernel Tuner☆397Updated this week
- Implementation of cryptographic primitives in Go☆13Mar 13, 2023Updated 3 years ago
- A GPU performance prediction toolkit for CUDA programs☆19Mar 25, 2019Updated 7 years ago
- High performance C++ Linear Algebra Library☆16Oct 12, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Work space for golang.org/x/perf version 2☆20Nov 14, 2020Updated 5 years ago
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- ☆14Mar 1, 2025Updated last year
- Slides and exercises for persistent memory programming tutorial☆14Nov 14, 2022Updated 3 years ago
- Public proposals, extensions, information and materials from the SYCL working group☆15Jan 26, 2024Updated 2 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Dec 13, 2025Updated 5 months ago
- Julia wrapper of CLBlast, a "tuned OpenCL BLAS library".☆14Aug 23, 2023Updated 2 years ago
- Prototype for a SPIR-V assembler and dissasembler. It provides a composable Java interface for generating SPIR-V code at runtime.☆14Oct 31, 2025Updated 6 months ago
- JOCLBlast - Java bindings for CLBlast☆15Mar 14, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- ☆17Dec 8, 2023Updated 2 years ago
- ☆10Jan 21, 2021Updated 5 years ago
- Selects VGA from LINUX or EFI!☆11Feb 17, 2020Updated 6 years ago
- Research compiler based on algorithmic skeletons☆23Oct 18, 2014Updated 11 years ago
- Provides a Simple Way to Calculate ANOVAs From Fitted Linear Models.☆21Jun 10, 2024Updated last year
- ☆17Feb 14, 2024Updated 2 years ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- A formally-verified provably-safe sandboxing Wasm-to-native compiler☆31Aug 30, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An OpenCL implementation of Connected Components Labeling based on "Connected Component Labeling in CUDA" from Onrej Stava, Bedrich Benes…☆12Apr 27, 2017Updated 9 years ago
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆21Apr 25, 2024Updated 2 years ago
- I-D that describes the algorithm identifiers for NIST's PQC ML-DSA for use in the Internet X.509 Public Key Infrastructure☆14Oct 30, 2025Updated 6 months ago
- A simple utility to create user-specified git commit hashes☆15Nov 24, 2025Updated 6 months ago
- PIRA - Automatic Instrumentation Refinement☆17Mar 28, 2024Updated 2 years ago
- The SHOC Benchmark Suite☆260Oct 6, 2025Updated 7 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆34Mar 19, 2023Updated 3 years ago
- High Speed elliptic curve signature system using a 260-bit Granger Moss Prime.☆14Jun 3, 2021Updated 4 years ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆15Oct 8, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Solution to harden TLS security by storing private keys and delegating operations to the Trused Execution Environment☆13Oct 10, 2022Updated 3 years ago
- GPU Static Modeling using PTX and Deep Structured Learning☆19Apr 1, 2020Updated 6 years ago
- Pressio is latin for compression. Libpressio is a C++ library with C compatible bindings to abstract between different lossless and lossy…☆16Dec 30, 2024Updated last year
- Simple anomaly detection for univariate time series data.☆11Jan 8, 2021Updated 5 years ago
- tools to create performance and roofline plots from measured data☆61Jun 10, 2014Updated 11 years ago
- ☆12Oct 19, 2014Updated 11 years ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆116Oct 10, 2023Updated 2 years ago