Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line
☆24Nov 25, 2025Updated 3 months ago
Alternatives and similar repositories for gpu-kernel-runner
Users that are interested in gpu-kernel-runner are comparing it to the libraries listed below
Sorting:
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆18Mar 8, 2024Updated last year
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- libmpdata++ - a library of parallel MPDATA-based solvers for systems of generalised transport equations☆12Jan 13, 2026Updated last month
- C++ library for graph ordering☆15Mar 20, 2020Updated 5 years ago
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆31Jun 26, 2024Updated last year
- CUDA kernel author's tools☆116Apr 24, 2022Updated 3 years ago
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- Plane Wave Density Functional Theory Code for the GPU☆12Jan 23, 2015Updated 11 years ago
- A C++ linear algebra algebra focusing on tensor tree classes designed for quantum dynamics simulations and machine learning applications☆20Apr 16, 2024Updated last year
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated last month
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 5 months ago
- Double precision raytracer for scientific or engineering applications.☆12May 18, 2024Updated last year
- wxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowled…☆14Feb 10, 2026Updated 2 weeks ago
- CUDA executors☆14Dec 4, 2020Updated 5 years ago
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- Fitting GLMMs with various approximations methods☆15Jun 20, 2019Updated 6 years ago
- A python code to study linear wave dynamics in two-dimensions☆14May 9, 2024Updated last year
- Tools and libraries for writing Kokkos-enabled HPC C++ in E3SM ecosystem☆20Updated this week
- Standalone Patatrack pixel tracking☆18Aug 28, 2025Updated 6 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Jan 30, 2026Updated last month
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 7, 2025Updated 10 months ago
- Generic procedures in Fortran☆23Jan 8, 2019Updated 7 years ago
- Tools for mesh adaptation☆23Dec 7, 2023Updated 2 years ago
- Header-only C++20 wrapper for MPI 4.0.☆16Oct 20, 2023Updated 2 years ago
- A c++ program for high-precision atomic structure calculations of one and two valence systems. Uses Hartree-Fock + correlation potential …☆24Updated this week
- A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows☆16Sep 24, 2021Updated 4 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆48Nov 14, 2024Updated last year
- study of cutlass☆22Nov 10, 2024Updated last year
- This is an example code based on a simple N-body simulation written in C++ which can be used to demonstrate the functionality of the Inte…☆18Apr 26, 2021Updated 4 years ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆872Feb 16, 2026Updated last week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆57Mar 20, 2025Updated 11 months ago
- Asynchronous I/O for HDF5☆24Feb 10, 2026Updated 2 weeks ago
- A C++ library for working with particles and grids in a parallel setting.☆20Dec 11, 2024Updated last year
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- GMDS, for Generic Mesh Data Structures and Services, provide a set of libraries to represent and handles meshes in the context of numeric…☆31Feb 13, 2026Updated 2 weeks ago
- A parallel, fast solver for the scalar advection-diffusion and the incompressible Navier-Stokes equations based on semi-Lagrangian/Volume…☆24Jan 28, 2019Updated 7 years ago
- Experimental Explicit Communications API for Kokkos☆30Feb 20, 2026Updated last week
- Command line argument parser (C++14)☆18Apr 11, 2019Updated 6 years ago