essentialsofparallelcomputing / EssentialsOfParallelComputing
Main Book repository for the Parallel and High Performance Computing book, Manning Publications
☆198Updated 2 years ago
Alternatives and similar repositories for EssentialsOfParallelComputing:
Users that are interested in EssentialsOfParallelComputing are comparing it to the libraries listed below
- Examples from Programming in Parallel with CUDA☆130Updated 2 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆247Updated last week
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆132Updated 4 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆663Updated last month
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆266Updated this week
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆722Updated 7 months ago
- Future home of hpc-tutorials.llnl.gov☆234Updated 2 weeks ago
- ☆527Updated this week
- Training material for Nsight developer tools☆151Updated 7 months ago
- CUDA Kernel Benchmarking Library☆595Updated 2 weeks ago
- A set of hands-on tutorials for CUDA programming☆217Updated 11 months ago
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆402Updated last year
- CUDA Matrix Multiplication Optimization☆173Updated 8 months ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆359Updated 2 weeks ago
- Example codes from the book Parallel Programming With OpenACC☆84Updated 8 years ago
- collection of benchmarks to measure basic GPU capabilities☆309Updated last month
- oneAPI Math Library (oneMath)☆657Updated this week
- Step-by-step optimization of CUDA SGEMM☆294Updated 2 years ago
- ☆431Updated 9 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆509Updated 3 years ago
- STREAM, for lots of devices written in many programming models☆330Updated 6 months ago
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆316Updated last month
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆424Updated 2 weeks ago
- ☆233Updated this week
- CUDA Core Compute Libraries☆1,555Updated this week
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆330Updated 2 months ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆76Updated last year
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆204Updated 3 months ago
- Learn OpenMP examples step by step☆91Updated 2 months ago