Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line
☆26Apr 26, 2026Updated this week
Alternatives and similar repositories for gpu-kernel-runner
Users that are interested in gpu-kernel-runner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆19Mar 8, 2024Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆57Mar 20, 2025Updated last year
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆31Jun 26, 2024Updated last year
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 7 months ago
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 8 months ago
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- wxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowled…☆14Feb 10, 2026Updated 2 months ago
- Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.☆14Dec 12, 2024Updated last year
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago
- Ariston Net integration with home assistant☆10Nov 3, 2020Updated 5 years ago
- Double precision raytracer for scientific or engineering applications.☆12May 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆883Feb 16, 2026Updated 2 months ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- Parallel Bytecode Interpreter For Heterogeneous Hardware☆15Aug 27, 2021Updated 4 years ago
- libmpdata++ - a library of parallel MPDATA-based solvers for systems of generalised transport equations☆12Apr 15, 2026Updated 2 weeks ago
- Plane Wave Density Functional Theory Code for the GPU☆12Jan 23, 2015Updated 11 years ago
- CUDA kernel author's tools☆117Apr 24, 2022Updated 4 years ago
- Global Memory and Threading runtime system☆25Dec 10, 2025Updated 4 months ago
- A C++ linear algebra algebra focusing on tensor tree classes designed for quantum dynamics simulations and machine learning applications☆20Apr 16, 2024Updated 2 years ago
- A C++/CUDA library for loading CUDA sparse textures on demand in OptiX renderers☆14Jun 4, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Sep 29, 2021Updated 4 years ago
- ♨️ Highest Throughput EVM L2 PoC, ThreadSafe Execution ♨️☆16Mar 18, 2024Updated 2 years ago
- Discord Rich Presence for Hatsune Miku: Project DIVA Mega Mix+.☆14Feb 23, 2026Updated 2 months ago
- Generic procedures in Fortran☆23Jan 8, 2019Updated 7 years ago
- Tool to collect and visualize memory usage of a process tree, mainly for Windows.☆19Dec 5, 2024Updated last year
- CUDA executors☆14Dec 4, 2020Updated 5 years ago
- Implementation of generative semantic grammar.☆17Jun 2, 2022Updated 3 years ago
- A python code to study linear wave dynamics in two-dimensions☆14May 9, 2024Updated last year
- 🔮 High-performance kaleidoscope effects for real-time applications☆15Apr 1, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Gary Brandt Bucher, II☆14Oct 22, 2025Updated 6 months ago
- Standalone Patatrack pixel tracking☆18Aug 28, 2025Updated 8 months ago
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 21, 2026Updated last week
- The cpp-framework is a c++ framework, Which includes http, thread pool, timer, json, database, encryption, reflection and so on.☆13Jan 6, 2019Updated 7 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆23Jan 11, 2024Updated 2 years ago
- A brain*** interpreter in C☆10Jan 21, 2023Updated 3 years ago
- Hands-on HPC I/O tutorial material☆18Oct 9, 2025Updated 6 months ago