robertmaynard / code-samplesLinks

Source code examples from the Parallel Forall Blog

☆96

Alternatives and similar repositories for code-samples

Users that are interested in code-samples are comparing it to the libraries listed below

Sorting:

myurtoglu / cudaforengineers
Source code repository for the projects from CUDA for Engineers
☆131Updated 3 years ago
lukeyeager / cmake-cuda-example
Example of how to use CUDA with CMake >= 3.8
☆70Updated last month
jeffhammond / dpcpp-tutorial
Intel Data Parallel C++ (and SYCL 2020) Tutorial.
☆94Updated 3 years ago
cusplibrary / cusplibrary
CUSP : A C++ Templated Sparse Matrix Library
☆415Updated this week
codeplaysoftware / SYCL-For-CUDA-Examples
Examples for using SYCL on CUDA
☆62Updated last month
eyalroz / cuda-kat
CUDA kernel author's tools
☆113Updated 3 years ago
GPMueller / eigen-cuda
MWE for using the Eigen library in CUDA kernels
☆119Updated 2 years ago
PatWie / cuda-design-patterns
Some CUDA design patterns and a bit of template magic for CUDA
☆156Updated 2 years ago
Apress / pro-TBB
Source Code for `Today’s TBB: C++ Parallel Programming with Threading Building Blocks, Second Edition' by Michael Voss and James Reinder…
☆193Updated 2 months ago
codeplaysoftware / portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆262Updated 6 months ago
llohse / libnpy
C++ library for reading and writing of numpy's .npy files
☆414Updated 10 months ago
PhDP / cuda-cmake-gtest-gbench-starter
A cross-platform CUDA/C++17 starter project with google test and google benchmark support.
☆39Updated 4 months ago
harrism / hemi
Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
☆348Updated 3 years ago
uestla / Sparse-Matrix
C++ implementation of sparse matrix using CRS (Compressed Row Storage) format
☆121Updated 4 years ago
bryancatanzaro / trove
Full-speed Array of Structures access
☆172Updated 2 years ago
tpn / cuda-samples
☆61Updated 2 years ago
jrmadsen / PTL
Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…
☆48Updated 8 months ago
codeplaysoftware / portDNN
portDNN is a library implementing neural network algorithms written using SYCL
☆113Updated last year
harrism / ranger
Generate simple index ranges in C++ and CUDA C++
☆39Updated 2 years ago
roostaiyan / CudaSharedPtr
Shared Pointer for Cuda Device Pointers and Cuda Streams, Smart Wrapper to Allocate and Deallocate Cuda Device Buffer.
☆1Updated 2 years ago
OpenCL / OpenCLCXXPortingGuidelines
Set of guidelines for porting OpenCL™ C to OpenCL C++
☆41Updated 8 years ago
cudpp / cudpp
CUDA Data Parallel Primitives Library
☆432Updated 6 years ago
ashvardanian / ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
☆103Updated 2 weeks ago
ndd314 / cuda_examples
☆68Updated 11 years ago
clMathLibraries / clSPARSE
a software library containing Sparse functions written in OpenCL
☆175Updated 5 years ago
zchee / cuda-sample
CUDA official sample codes
☆372Updated 9 years ago
ecrc / kblas-gpu
Subset of BLAS routines optimized for NVIDIA GPUs
☆71Updated 2 years ago
HiPerCoRe / KTT
Kernel Tuning Toolkit
☆62Updated last month
OpenMP / Examples
LaTeX Examples Document Source
☆244Updated 7 months ago
mgopshtein / cudacpp
C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.
☆55Updated 6 years ago