eyalroz/gpu-kernel-runner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eyalroz/gpu-kernel-runner)

eyalroz / gpu-kernel-runner

Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line

☆26

Alternatives and similar repositories for gpu-kernel-runner

Users that are interested in gpu-kernel-runner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnkaChan / CuMatrix
View on GitHub
Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer
☆20Mar 8, 2024Updated 2 years ago
ProjectPhysX / PTXprofiler
View on GitHub
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
☆59Mar 20, 2025Updated last year
eth-cscs / spla
View on GitHub
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…
☆32Jun 26, 2024Updated 2 years ago
gunrock / loops
View on GitHub
🎃 GPU load-balancing library for regular and irregular computations.
☆67Jun 25, 2026Updated 3 weeks ago
argonne-lcf / alcl
View on GitHub
Argonne Leadership Computing Facility OpenCL tutorial
☆10Aug 22, 2025Updated 10 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
llnl / gecko
View on GitHub
C++ library for graph ordering
☆15Mar 20, 2020Updated 6 years ago
ricosjp / allgebra
View on GitHub
Base container for developing C++ and Fortran HPC applications
☆18Jun 14, 2022Updated 4 years ago
yester31 / Cutlass_EX
View on GitHub
study of cutlass
☆22Nov 10, 2024Updated last year
ye-luo / openmp-target
View on GitHub
OpenMP offload playground
☆10Nov 16, 2024Updated last year
igfuw / libmpdataxx
View on GitHub
libmpdata++ - a library of parallel MPDATA-based solvers for systems of generalised transport equations
☆12Jun 19, 2026Updated last month
bsc-performance-tools / paraver-kernel
View on GitHub
wxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowled…
☆16Feb 10, 2026Updated 5 months ago
JimZeyuYang / GPU_Power_Benchmark
View on GitHub
Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.
☆15Dec 12, 2024Updated last year
bytemaster / mace
View on GitHub
Massively Asynchronous Coding Environment
☆18Oct 21, 2012Updated 13 years ago
hiddenSymmetries / mango
View on GitHub
Multiprocessor Algorithms for Nonlinear Gradient-free Optimization
☆12Jul 1, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
doctorvanmartin / homeassistant-ariston-sensor
View on GitHub
Ariston Net integration with home assistant
☆10Nov 3, 2020Updated 5 years ago
cstatz / blazert
View on GitHub
Double precision raytracer for scientific or engineering applications.
☆12May 18, 2024Updated 2 years ago
DARMA-tasking / magistrate
View on GitHub
DARMA/magistrate => Serialization and checkpointing library
☆12Jan 26, 2026Updated 5 months ago
STEllAR-GROUP / blaze_cuda
View on GitHub
WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze
☆21Nov 18, 2019Updated 6 years ago
eyalroz / cuda-api-wrappers
View on GitHub
Thin, unified, C++-flavored wrappers for the CUDA APIs
☆900Updated this week
beehive-lab / ProtonVM
View on GitHub
Parallel Bytecode Interpreter For Heterogeneous Hardware
☆15Aug 27, 2021Updated 4 years ago
rehnd / dft-gpu
View on GitHub
Plane Wave Density Functional Theory Code for the GPU
☆12Jan 23, 2015Updated 11 years ago
melvic-ybanez / ecena
View on GitHub
A 3D Scene Renderer written in C++
☆17Mar 5, 2023Updated 3 years ago
roman-ellerbrock / QuTree
View on GitHub
A C++ linear algebra algebra focusing on tensor tree classes designed for quantum dynamics simulations and machine learning applications
☆20Apr 16, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
libocca / occa.py
View on GitHub
OCCA Python API: JIT Compilation for Multiple Architectures
☆11Dec 20, 2019Updated 6 years ago
NVIDIA / otk-demand-loading
View on GitHub
A C++/CUDA library for loading CUDA sparse textures on demand in OptiX renderers
☆14Jun 4, 2025Updated last year
cms-patatrack / pixeltrack-standalone
View on GitHub
Standalone Patatrack pixel tracking
☆18May 29, 2026Updated last month
nyotis / SpinXFormGPU
View on GitHub
nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…
☆17Mar 14, 2018Updated 8 years ago
matiaslindgren / cuda-memory-access-recorder
View on GitHub
Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser
☆13Nov 17, 2020Updated 5 years ago
modern-fortran / generic-procedures
View on GitHub
Generic procedures in Fortran
☆22Jan 8, 2019Updated 7 years ago
DenuvoSoftwareSolutions / Onlooker
View on GitHub
Tool to collect and visualize memory usage of a process tree, mainly for Windows.
☆19Dec 5, 2024Updated last year
heogden / glmmsr
View on GitHub
Fitting GLMMs with various approximations methods
☆15Jun 20, 2019Updated 7 years ago
Miyamura80 / SuperParallel
View on GitHub
♨️ Highest Throughput EVM L2 PoC, ThreadSafe Execution ♨️
☆16Mar 18, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jaredhoberock / cudex
View on GitHub
CUDA executors
☆14Dec 4, 2020Updated 5 years ago
pvthinker / wave2d
View on GitHub
A python code to study linear wave dynamics in two-dimensions
☆14Jun 15, 2026Updated last month
egecetin / libKaleidoscope
View on GitHub
🔮 High-performance kaleidoscope effects for real-time applications
☆15Jul 1, 2026Updated 2 weeks ago
asaparov / grammar
View on GitHub
Implementation of generative semantic grammar.
☆17Jun 2, 2022Updated 4 years ago
pprablanc / ppsrt
View on GitHub
A python algorithm to change the pitch of the voice in real time
☆13Dec 13, 2020Updated 5 years ago
VRGroupRWTH / mpi
View on GitHub
Header-only C++20 wrapper for MPI 4.0.
☆16Oct 20, 2023Updated 2 years ago
brandtbucher / brandtbucher
View on GitHub
Gary Brandt Bucher, II
☆14Oct 22, 2025Updated 8 months ago