jeffhammond / dpcpp-tutorialLinks
Intel Data Parallel C++ (and SYCL 2020) Tutorial.
☆93Updated 3 years ago
Alternatives and similar repositories for dpcpp-tutorial
Users that are interested in dpcpp-tutorial are comparing it to the libraries listed below
Sorting:
- SYCL Benchmark Suite☆64Updated 4 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 5 months ago
- Examples for using SYCL on CUDA☆62Updated last week
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆274Updated 2 months ago
- SYCL Open Source Specification☆136Updated this week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆120Updated last week
- SYCL Conformance Tests☆70Updated this week
- RAJA Performance Suite☆117Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆130Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Advanced Profiling and Analytics for AMD Hardware☆156Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆84Updated this week
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated 2 weeks ago
- Next generation LAPACK implementation for ROCm platform☆102Updated this week
- oneAPI Level Zero Conformance & Performance test content☆54Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated this week
- High-level C++ for Accelerator Clusters☆145Updated last week
- STREAM, for lots of devices written in many programming models☆343Updated 9 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆227Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆173Updated this week
- SYCL Reference Manual☆28Updated last year
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Training examples for SYCL☆42Updated last month
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆39Updated last week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆51Updated this week
- DLA-Future☆75Updated last month