Apress / data-parallel-CPP
Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian (Apress, 2020).
☆269Updated 2 weeks ago
Alternatives and similar repositories for data-parallel-CPP:
Users that are interested in data-parallel-CPP are comparing it to the libraries listed below
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last week
- ☆236Updated this week
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆481Updated 2 months ago
- STREAM, for lots of devices written in many programming models☆332Updated 7 months ago
- ☆250Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆327Updated last week
- oneAPI Level Zero Specification Headers and Loader☆255Updated this week
- SYCL Open Source Specification☆134Updated last week
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 10 months ago
- SYCL Benchmark Suite☆64Updated last month
- Examples for using SYCL on CUDA☆62Updated last month
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Advanced Profiling and Analytics for AMD Hardware☆145Updated this week
- Examples for HIP☆204Updated 4 months ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆100Updated last week
- Source Code for `Today’s TBB: C++ Parallel Programming with Threading Building Blocks, Second Edition' by Michael Voss and James Reinder…☆183Updated 2 weeks ago
- oneAPI Math Library (oneMath)☆665Updated last week
- ROCm Parallel Primitives☆171Updated this week
- ☆20Updated 2 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- SYCL Conformance Tests☆69Updated this week
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆116Updated 5 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- CUDA Kernel Benchmarking Library☆618Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated this week
- ☆61Updated 3 months ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆253Updated 3 weeks ago
- Next generation LAPACK implementation for ROCm platform☆99Updated this week