essentialsofparallelcomputing / EssentialsOfParallelComputing
Main Book repository for the Parallel and High Performance Computing book, Manning Publications
☆176Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for EssentialsOfParallelComputing
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆187Updated this week
- Examples from Programming in Parallel with CUDA☆108Updated last year
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆558Updated 3 weeks ago
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆364Updated last year
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆252Updated last month
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- ☆486Updated this week
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆123Updated this week
- Example codes from the book Parallel Programming With OpenACC☆82Updated 7 years ago
- Training material for Nsight developer tools☆129Updated 3 months ago
- CUDA Kernel Benchmarking Library☆519Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆100Updated last year
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆67Updated last year
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆128Updated 4 years ago
- oneAPI Math Kernel Library (oneMKL) Interfaces☆622Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- Step-by-step optimization of CUDA SGEMM☆240Updated 2 years ago
- ☆217Updated last week
- ☆393Updated 9 years ago
- Training materials provided by OpenACC.org.☆84Updated 3 months ago
- STREAM, for lots of devices written in many programming models☆325Updated 2 months ago
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆75Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆196Updated 2 weeks ago
- Next generation LAPACK implementation for ROCm platform☆94Updated this week
- Future home of hpc-tutorials.llnl.gov☆224Updated 3 months ago
- Intermediate MPI lesson☆26Updated last year
- CUDA Core Compute Libraries☆1,278Updated this week
- CUDA Matrix Multiplication Optimization☆141Updated 4 months ago
- ☆231Updated this week
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆615Updated 3 months ago