JGU-HPC / parallelprogrammingbookLinks
supplementary material/programming exercises
☆74Updated 4 years ago
Alternatives and similar repositories for parallelprogrammingbook
Users that are interested in parallelprogrammingbook are comparing it to the libraries listed below
Sorting:
- A Library for fast Hash Tables on GPUs☆126Updated last month
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 3 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated 3 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆113Updated 2 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- A warp-oriented dynamic hash table for GPUs☆76Updated last year
- Future home of hpc-tutorials.llnl.gov☆249Updated 8 months ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆73Updated 2 years ago
- BGHT: High-performance static GPU hash tables.☆72Updated 4 months ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆83Updated last year
- Some CUDA design patterns and a bit of template magic for CUDA☆156Updated 2 years ago
- tools to create performance and roofline plots from measured data☆60Updated 11 years ago
- High-performance, GPU-aware communication library☆86Updated 10 months ago
- Example codes from the book Parallel Programming With OpenACC☆86Updated 8 years ago
- Learn OpenMP examples step by step☆99Updated 10 months ago
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆149Updated 7 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆210Updated 3 weeks ago
- LaTeX Examples Document Source☆251Updated last week
- ☆48Updated 5 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆321Updated last week
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆219Updated 3 years ago
- Examples for using SYCL on CUDA☆62Updated 2 months ago
- ☆267Updated last week
- General Purpose Timing Library☆34Updated 3 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 4 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- MagmaDNN: a simple deep learning framework in c++☆50Updated 5 years ago
- RAJA Performance Suite☆125Updated this week
- Sparse matrix computation library for GPU☆59Updated 5 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆57Updated 2 years ago