ptheywood / cuda-cmake-github-actionsLinks

☆59

Alternatives and similar repositories for cuda-cmake-github-actions

Users that are interested in cuda-cmake-github-actions are comparing it to the libraries listed below

Sorting:

llohse / libnpy
C++ library for reading and writing of numpy's .npy files
☆414Updated 10 months ago
eyalroz / cuda-kat
CUDA kernel author's tools
☆113Updated 3 years ago
harrism / ranger
Generate simple index ranges in C++ and CUDA C++
☆39Updated 2 years ago
ashvardanian / ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
☆103Updated 2 weeks ago
Jimver / cuda-toolkit
GitHub Action to install CUDA
☆182Updated last week
PatWie / cuda-design-patterns
Some CUDA design patterns and a bit of template magic for CUDA
☆156Updated 2 years ago
codeplaysoftware / portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆262Updated 6 months ago
milakov / int_fastdiv
Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.
☆71Updated 9 years ago
robertmaynard / code-samples
Source code examples from the Parallel Forall Blog
☆96Updated 6 years ago
ProjectPhysX / PTXprofiler
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
☆55Updated 4 months ago
NVIDIA / jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
☆547Updated 2 weeks ago
codeplaysoftware / portDNN
portDNN is a library implementing neural network algorithms written using SYCL
☆113Updated last year
MuGdxy / muda
μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updatin…
☆183Updated last month
owensgroup / BGHT
BGHT: High-performance static GPU hash tables.
☆70Updated last month
NVlabs / cub
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
☆84Updated last year
Ahdhn / CUDATemplate
Template for starting CUDA/C++ project using CMake with Github Action for CI
☆31Updated last month
bryancatanzaro / trove
Full-speed Array of Structures access
☆172Updated 2 years ago
HiPerCoRe / KTT
Kernel Tuning Toolkit
☆62Updated last month
mark-poscablo / gpu-prefix-sum
CUDA implementation of exclusive prefix sum via Blelloch's algorithm
☆28Updated 8 years ago
NVIDIA / cuCollections
☆561Updated this week
eyalroz / cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
☆853Updated this week
mattdean1 / cuda
An implementation of parallel exclusive scan in CUDA
☆62Updated 7 years ago
lukeyeager / cmake-cuda-example
Example of how to use CUDA with CMake >= 3.8
☆70Updated last month
NVIDIA / NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆427Updated 2 weeks ago
codeplaysoftware / SYCL-For-CUDA-Examples
Examples for using SYCL on CUDA
☆62Updated last month
Maratyszcza / FP16
Conversion to/from half-precision floating point formats
☆362Updated last year
jeffhammond / dpcpp-tutorial
Intel Data Parallel C++ (and SYCL 2020) Tutorial.
☆94Updated 3 years ago
ROCm / rocThrust
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆119Updated this week
enp1s0 / cutf
CUDA Template Functions
☆19Updated 7 months ago
PhDP / cuda-cmake-gtest-gbench-starter
A cross-platform CUDA/C++17 starter project with google test and google benchmark support.
☆39Updated 4 months ago