JGU-HPC / parallelprogrammingbook
supplementary material/programming exercises
☆73Updated 3 years ago
Alternatives and similar repositories for parallelprogrammingbook:
Users that are interested in parallelprogrammingbook are comparing it to the libraries listed below
- ☆43Updated 4 years ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆78Updated 8 months ago
- BGHT: High-performance static GPU hash tables.☆63Updated 2 weeks ago
- Efficient SpGEMM on GPU using CUDA and CSR☆52Updated last year
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆72Updated last month
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- A Library for fast Hash Tables on GPUs☆115Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated last month
- TLB Benchmarks☆33Updated 7 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆107Updated last year
- ☆28Updated this week
- A warp-oriented dynamic hash table for GPUs☆73Updated last year
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Full-speed Array of Structures access☆169Updated 2 years ago
- ☆34Updated 5 years ago
- General Purpose Timing Library☆34Updated 11 months ago
- A library of various helper routines and frameworks used by many of the lab's software☆51Updated 11 months ago
- ☆29Updated 5 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆50Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 4 years ago
- A cross-platform CUDA/C++17 starter project with google test and google benchmark support.☆37Updated last month
- Learn OpenMP examples step by step☆91Updated 3 months ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆33Updated 5 years ago
- Parallel Graph Input Output☆18Updated last year
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago