Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line
☆24Mar 15, 2026Updated this week
Alternatives and similar repositories for gpu-kernel-runner
Users that are interested in gpu-kernel-runner are comparing it to the libraries listed below
Sorting:
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆18Mar 8, 2024Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆57Mar 20, 2025Updated last year
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆31Jun 26, 2024Updated last year
- This repository provides implementation of composable allocators described by Andrei Alexandrescu on CppCon 2015☆15Apr 29, 2018Updated 7 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 6 months ago
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- study of cutlass☆22Nov 10, 2024Updated last year
- wxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowled…☆14Feb 10, 2026Updated last month
- Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.☆14Dec 12, 2024Updated last year
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated 2 weeks ago
- Ariston Net integration with home assistant☆10Nov 3, 2020Updated 5 years ago
- Double precision raytracer for scientific or engineering applications.☆12May 18, 2024Updated last year
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated last month
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆877Feb 16, 2026Updated last month
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- Plane Wave Density Functional Theory Code for the GPU☆12Jan 23, 2015Updated 11 years ago
- CUDA kernel author's tools☆116Apr 24, 2022Updated 3 years ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- A C++ linear algebra algebra focusing on tensor tree classes designed for quantum dynamics simulations and machine learning applications☆20Apr 16, 2024Updated last year
- A C++/CUDA library for loading CUDA sparse textures on demand in OptiX renderers☆14Jun 4, 2025Updated 9 months ago
- ☆12Sep 29, 2021Updated 4 years ago
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- Generic procedures in Fortran☆23Jan 8, 2019Updated 7 years ago
- Discord Rich Presence for Hatsune Miku: Project DIVA Mega Mix+.☆14Feb 23, 2026Updated 3 weeks ago
- Tools and libraries for writing Kokkos-enabled HPC C++ in E3SM ecosystem☆20Updated this week
- Tool to collect and visualize memory usage of a process tree, mainly for Windows.☆19Dec 5, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- CUDA executors☆14Dec 4, 2020Updated 5 years ago
- Implementation of generative semantic grammar.☆17Jun 2, 2022Updated 3 years ago
- A python code to study linear wave dynamics in two-dimensions☆14May 9, 2024Updated last year
- Gary Brandt Bucher, II☆14Oct 22, 2025Updated 4 months ago
- Standalone Patatrack pixel tracking☆18Aug 28, 2025Updated 6 months ago
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Mar 12, 2026Updated last week
- The cpp-framework is a c++ framework, Which includes http, thread pool, timer, json, database, encryption, reflection and so on.☆13Jan 6, 2019Updated 7 years ago
- A Powerful AST Parser for Solidity☆10Nov 25, 2025Updated 3 months ago