JGU-HPC / parallelprogrammingbookLinks
supplementary material/programming exercises
☆72Updated 3 years ago
Alternatives and similar repositories for parallelprogrammingbook
Users that are interested in parallelprogrammingbook are comparing it to the libraries listed below
Sorting:
- A warp-oriented dynamic hash table for GPUs☆73Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆54Updated last year
- BGHT: High-performance static GPU hash tables.☆65Updated last month
- ☆44Updated 4 years ago
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- A light-weight MPI profiler.☆95Updated 10 months ago
- A Library for fast Hash Tables on GPUs☆119Updated 2 years ago
- TLB Benchmarks☆34Updated 7 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- Some CUDA design patterns and a bit of template magic for CUDA☆154Updated 2 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆131Updated 5 years ago
- General Purpose Timing Library☆34Updated last year
- Sparse matrix computation library for GPU☆56Updated 4 years ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆206Updated 3 years ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆40Updated 6 years ago
- High-performance, GPU-aware communication library☆87Updated 4 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆46Updated 9 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated 11 months ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆75Updated this week
- CSR5-based SpMV on CPUs, GPUs and Xeon Phi☆101Updated 11 months ago
- ☆245Updated last week
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- ☆34Updated 5 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- Examples for using SYCL on CUDA☆62Updated 3 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆32Updated 4 years ago