Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line
☆26Jun 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for gpu-kernel-runner
Users that are interested in gpu-kernel-runner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆20Mar 8, 2024Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆58Mar 20, 2025Updated last year
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆32Jun 26, 2024Updated 2 years ago
- This repository provides implementation of composable allocators described by Andrei Alexandrescu on CppCon 2015☆15Apr 29, 2018Updated 8 years ago
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 4 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- wxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowled…☆15Feb 10, 2026Updated 4 months ago
- Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.☆14Dec 12, 2024Updated last year
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated 3 months ago
- Ariston Net integration with home assistant☆10Nov 3, 2020Updated 5 years ago
- Double precision raytracer for scientific or engineering applications.☆12May 18, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated 5 months ago
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆891Jun 4, 2026Updated 3 weeks ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- CUDA kernel author's tools☆116Apr 24, 2022Updated 4 years ago
- Global Memory and Threading runtime system☆25Dec 10, 2025Updated 6 months ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- A C++ linear algebra algebra focusing on tensor tree classes designed for quantum dynamics simulations and machine learning applications☆20Apr 16, 2024Updated 2 years ago
- A C++/CUDA library for loading CUDA sparse textures on demand in OptiX renderers☆14Jun 4, 2025Updated last year
- ☆12Sep 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- ♨️ Highest Throughput EVM L2 PoC, ThreadSafe Execution ♨️☆16Mar 18, 2024Updated 2 years ago
- Generic procedures in Fortran☆22Jan 8, 2019Updated 7 years ago
- Tools and libraries for writing Kokkos-enabled HPC C++ in E3SM ecosystem☆22Jun 18, 2026Updated last week
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- A python code to study linear wave dynamics in two-dimensions☆14Jun 15, 2026Updated 2 weeks ago
- Gary Brandt Bucher, II☆14Oct 22, 2025Updated 8 months ago
- Standalone Patatrack pixel tracking☆18May 29, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 21, 2026Updated 2 months ago
- A translation of machine learning terms to Swedish☆10Nov 1, 2019Updated 6 years ago
- A brain*** interpreter in C☆10Jan 21, 2023Updated 3 years ago
- Hands-on HPC I/O tutorial material☆18Oct 9, 2025Updated 8 months ago
- MPI+Kokkos implementation of spectral difference method (SDM) high order schemes☆30Feb 2, 2025Updated last year
- Fitting GLMMs with various approximations methods☆15Jun 20, 2019Updated 7 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆47Nov 14, 2024Updated last year